Nucleic acid labeling compounds

ABSTRACT

Nucleic acid labeling compounds containing heterocyclic derivatives are disclosed. The heterocyclic derivative containing compounds are synthesized by condensing a heterocyclic derivative with a cyclic group (e.g. a ribofuranose derivative). The labeling compounds are suitable for enzymatic attachment to a nucleic acid, either terminally or internally, to provide a mechanism of nucleic acid detection.

RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 09/952,387, filed Sep. 11, 2001 now U.S. Pat. No. 6,965,020, now allowed; which is a continuation-in-part of U.S. application Ser. No. 09/780,574, filed Feb. 9, 2001, now U.S. Pat. No. 6,596,856; which is a continuation-in-part of U.S. application Ser. No. 09/126,645, filed Jul. 31, 1998, now abandoned; which is a continuation-in-part of U.S. application Ser. No.: 08/882,649, filed Jun. 25, 1997, now U.S. Pat. No. 6,344,316; which is a continuation of PCT/US97/01603, filed on Jan. 22, 1997 designating the Unites States of America, which claims priority from U.S. Provisional Application No. 60/010,471 filed on Jan. 23, 1996 and U.S. Provisional Application No. 60/035,170, filed on Jan. 9, 1997, all of which are herein incorporated by reference in their entirety for all purposes.

STATEMENT OF RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH

This invention was made with Government support under contract 70NANB5H1031 awarded by the Advanced Technology Program of the National Institute of Standards and Technology. The Government has certain rights in this invention.

BACKGROUND OF THE INVENTION

Gene expression in diseased and healthy individuals is oftentimes different and characterizable. The ability to monitor gene expression in such cases provides medical professionals with a powerful diagnostic tool. This form of diagnosis is especially important in the area of oncology, where it is thought that the overexpression of an oncogene, or the underexpression of a tumor suppressor gene, results in tumorogenesis. See Mikkelson et al. J. Cell. Biochem. 1991, 46, 3-8.

One can indirectly monitor gene expression, for example, by measuring a nucleic acid (e.g., mRNA) that is the transcription product of a targeted gene. The nucleic acid is chemically or biochemically labeled with a detectable moiety and allowed to hybridize with a localized nucleic acid probe of known sequence. The detection of a labeled nucleic acid at the probe position indicates that the targeted gene has been expressed. See International Application Publication Nos. WO 97/27317, WO 92/10588 and WO 97/10365.

The labeling of a nucleic acid is typically performed by covalently attaching a detectable group (label) to either an internal or terminal position. Scientists have reported a number of detectable nucleotide analogues that have been enzymatically incorporated into an oligo- or polynucleotide. Langer et al., for example, disclosed analogues of dUTP and UTP that contain a covalently bound biotin moiety. Proc. Natl. Acad. Sci. USA 1981, 78, 6633-6637. The analogues, shown below, possess an allylamine linker arm that is attached to the C-5 position of the pyrimidine ring. The dUTP and UTP analogues, wherein R is H or OH, were incorporated into a polynucleotide.

Petrie et al. disclosed a dATP analogue, 3-[5-[(N-biotinyl-6-aminocaproyl)-amino]pentyl]-1-(2-deoxy-β-D-erythro-pentofuranosyl)-1 H-pyrazolo[3,4-d]pyrimidin-4-amine-5′-triphosphate. Bioconjugate Chem. 1991, 2, 441-446. The analogue, shown below, is modified at the 3-position with a linker arm that is attached to a biotin moiety. Petrie et al. reported that the compound wherein R is biotin is incorporated into DNA by nick translation.

Prober et al. disclosed a set of four dideoxynucleotides, each containing a succinylfluorescein dye. Science 1987, 238, 336-341. The dideoxynucleotides, one of which is shown below, were enzymatically incorporated into an oligonucleotide through a template directed extension of a primer. The compounds provided for a DNA sequencing method based on gel migration.

Herrlein et al. disclosed modified nucleoside trisphosphates of the four DNA bases. Helv. Chim. Acta 1994, 77, 586-596. The compounds, one of which is shown below, contain a 3′-amino group containing radioactive or fluorescent moieties. Herrlein et al. further described the use of the nucleoside analogues as DNA chain terminators.

Cech et al. disclosed 3′-amino-functionalized nucleoside triphosphates. Collect. Czech. Chem. Commun. 1996, 61, S297-S300. The compounds, one of which is shown below, contain a fluorescein attached to the 3′-position through an amino linker. Cech et al. proposed that the described functionalized nucleosides would be useful as terminators for DNA sequencing.

DISCLOSURE OF THE INVENTION

The present invention relates to nucleic acid labeling compounds. More specifically, the invention provides heterocyclic derivatives containing a detectable moiety. The invention also provides methods of making such heterocyclic derivatives. It further provides methods of attaching the heterocyclic derivatives to a nucleic acid.

The development of a novel nucleic acid labeling compound that is effectively incorporated into a nucleic acid to provide a readily detectable composition would benefit genetic analysis technologies. It would aid, for example, in the monitoring of gene expression and the detection and screening of mutations and polymorphisms. Such a compound should be suitable for enzymatic incorporation into a nucleic acid. Furthermore, the nucleic acid to which the labeling compound is attached should maintain its ability to bind to a probe, such as a complementary nucleic acid.

The present invention provides nucleic acid labeling compounds that are capable of being enzymatically incorporated into a nucleic acid. The nucleic acids to which the compounds are attached maintain their ability to bind to a complementary nucleic acid sequence.

The nucleic acid labeling compounds of the present invention are of the following structure: A—O—CH₂-T-H_(c)-L-(M)_(m)-Q wherein A is hydrogen or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid; T is a template moiety; H_(c) is a heterocyclic group; L is a linker moiety; Q is a detectable moiety; and M is a connecting group, wherein m is an integer ranging from 0 to about 5.

In one embodiment, the nucleic acid labeling compounds have the following structures:

wherein A is H or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid;

-   -   X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H,         alkyl or aryl; Y is H, N₃, F, OR₉, SR₉, or NHR₉, wherein R₉ is         H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl         or aryl; L is is amido alkyl; Q is a detectable moiety; and, M         is a connecting group, wherein m is an integer ranging from 0 to         about 3.

In another embodiment, A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —C(O)NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or a carboxyfluorescein; and, M is —CO(CH₂)₅NH—, wherein m is 1 or 0.

In another embodiment, Y is H or OH; Z is H or OH; L is —C(O)NH(CH₂)₄NH—; Q is biotin; and, M is —O(CH₂)₅NH, wherein m is 1.

In another embodiment, Y is H or OH; Z is H or OH; L is —C(O)NH(CH₂)₄NH—; Q is 5-carboxyfluorescein; and, m is 0.

In one embodiment, the nucleic acid labeling compounds have the following structures:

wherein A is H or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid; X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H, alkyl or aryl; Y is H, N₃, F, OR₉, SR₉ or NHR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is amino alkyl; Q is a detectable moiety; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3.

In another embodiment, A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —NH(CH₂)_(n)—NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

In another embodiment, Y is H or OH; Z is H or OH; L is —NH(CH₂)₄NH—; Q is biotin; and, m is 0.

In another embodiment, Y is H or OH; Z is H or OH; L is —NH(CH₂)₄NH—; Q is 5-carboxyfluorescein; and, m is 0.

In one embodiment, the nucleic acid labeling compounds have the following structures:

wherein A is H or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid; X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H, alkyl or aryl; Y is H, N₃, F, OR₉, SR₉ or NHR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is alkynyl alkyl; Q is a detectable moiety; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3.

In another embodiment, A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —C≡C(CH₂)_(n)NH—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH—, wherein m is 1 or 0.

In another embodiment, Y is H or OH; Z is H or OH; L is —C≡CCH₂NH—; Q is biotin; and, m is 1.

In another embodiment, Y is H or OH; Z is H or OH; L is —C≡CCH₂NH—; Q is 5-carboxyfluorescein; and, m is 1.

In one embodiment, the nucleic acid labeling compounds have the following structures:

wherein A is H or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid; X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H, alkyl or aryl; Y is H, N₃, F, OR₉, SR₉ or NHR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is amino alkyl; Q is a detectable moiety; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3.

In another embodiment, A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

In another embodiment, Y is H or OH; Z is H or OH; L is —NH(CH₂)₄NH—; Q is biotin; and, M is CO(CH₂)₅NH—, wherein m is 1.

In another embodiment, Y is H or OH; Z is H or OH; L is —NH(CH₂)₄NH—; Q is 5-carboxyfluorescein; and, M is —CO(CH₂)₅NH—, wherein m is 1.

In one embodiment, the nucleic acid labeling compounds have the following structures:

wherein A is H or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid; X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H, alkyl or aryl; Y is H, N₃, F, OR₉, SR₉ or NHR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is functionalized alkyl, alkenyl alkyl or alkynyl alkyl; Q is a detectable moiety; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3.

In another embodiment, A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —CH═CH(CH₂)_(n)NH—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

In another embodiment, Y is H or OH; Z is H or OH; L is —CH═CHCH₂NH—; Q is biotin; and, m is 0.

In another embodiment, Y is H or OH; Z is H or OH; L is —CH═CHCH₂NH—; Q is 5-carboxyfluorescein; and, m is 0.

In one embodiment, the nucleic acid labeling compounds have the following structures:

wherein A is H or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid; X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H, alkyl or aryl; Y is H, N₃, F, OR₉, SR₉ or NHR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is functionalized alkyl, alkenyl alkyl or alkynyl alkyl; Q is a detectable moiety; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3.

In another embodiment, A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —CH═CH(CH₂)_(n)NH—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

In another embodiment, Y is H or OH; Z is H or OH; L is —CH═CHCH₂NH—; Q is biotin; and, m is 0.

In another embodiment, Y is H or OH; Z is H or OH; L is —CH═CHCH₂NH—; Q is 5-carboxyfluorescein; and, m is 0.

In one embodiment, the nucleic acid labeling compounds have the following structures:

wherein A is H or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid; X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H, alkyl or aryl; Y is H, N₃, F, OR₉, SR₉ or NHR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is functionalized alkyl; Q is a detectable moiety; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3.

In another embodiment, A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —(CH₂)_(n)C(O)—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or fluorescein; and, M is —NH(CH₂CH₂O)_(k)NH—, wherein, k is an integer from 1 to about 5, wherein m is 1 or 0. preferably k is 1 or 2;

In another embodiment, Y is H or OH; Z is H or OH; L is —CH₂—C(O)—; Q is a carboxyfluorescein or biotin; and M is —NH(CH₂CH₂O)_(k)NH—, wherein, k is 2 and m is 1.

In another embodiment, Y is OH; Z is OH; L is —CH₂—C(O)—; Q is biotin; and M is —NH(CH₂CH₂O)_(k)NH—, wherein, k is 2 and m is 1.

In another embodiment, L is —CH═CHCH₂NH—; Q is a carboxyfluorescein; and M is —NH(CH₂CH₂O)_(k)NH—, wherein, k is 2 and m is 1.

In one embodiment, the nucleic acid labeling compounds have the following structures:

wherein A is H or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid;

-   -   X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H,         alkyl or aryl; Y is H, N₃, F, OR₉, SR₉ or NHR₉, wherein R₉ is H,         alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or         aryl; L is functionalized alkyl; Q is a detectable moiety; and,         M is a connecting group, wherein m is an integer ranging from 0         to about 3.

In another embodiment, A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —(CH₂)_(n)C(O)—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or a fluorescein; and, a first M is —NH(CH₂)_(n)NH—, wherein n is an integer from about 2 to about 10, and a second M is —CO(CH₂)₅NH—, wherein m is 1 or 2.

In another embodiment, Y is H or OH; Z is H or OH; L is —(CH₂)₂C(O)—, Q is biotin or a carboxyfluorescein; and a first M is —NH(CH₂)₂NH—, and a second M is —CO(CH₂)₅NH—, wherein m is 2.

In another embodiment, Y is OH; Z is OH; L is —(CH₂)₂C(O)—, Q is a carboxyfluorescein; and, a first M is —NH(CH₂)₂NH—, and a second M is —CO(CH₂)₅NH—, wherein m is 2.

In another embodiment, Y is OH; Z is OH; L is —(CH₂)₂C(O)—, Q is or biotin; and, a first M is —NH(CH₂)₂NH—, and a second M is —CO(CH₂)₅NH—, wherein m is 2.

In one embodiment, the nucleic acid labeling compounds have the following structures:

wherein A is H or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid;

-   -   X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H,         alkyl or aryl; Y is H, N₃, F, OR₉, SR₉ or NHR₉, wherein R₉ is H,         alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or         aryl; L is amido alkyl; Q is a detectable moiety; and, M is a         connecting group, wherein m is an integer ranging from 0 to         about 3.

In another embodiment, A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —C(O)NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or a fluorescein; wherein m is 0, 1, or 2.

In another embodiment, Y is H or OH; Z is H or OH; L is —C(O)NH(CH₂)₄NH—; and Q is biotin or a carboxyfluorescein.

In another embodiment, Y is OH; Z is H; L is —C(O)NH(CH₂)₄NH—; Q is biotin.

In another embodiment, Y is OH; Z is H; L is —C(O)NH(CH₂)₄NH—; and Q is a carboxyfluorescein.

The present invention also provides nucleic acid derivatives produced by coupling a nucleic acid labeling compound with a nucleic acid and hybridization products comprising the nucleic acid derivatives bound to a complementary probe.

In one embodiment, the nucleic acid labeling compounds used in the coupling have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —C(O)NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or a carboxyfluorescein; and, M is —CO(CH₂)₅NH—, wherein m is 1 or 0.

The hybridization product formed from this nucleic acid derivative comprises the nucleic acid derivative bound to a complementary probe. In one embodiment, the probe is attached to a glass chip.

In another embodiment, the nucleic acid labeling compounds used in the coupling have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

The hybridization product formed from this nucleic acid derivative comprises the nucleic acid derivative bound to a complementary probe. In one embodiment, the probe is attached to a glass chip.

In another embodiment, the nucleic acid labeling compounds used in the coupling have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —C≡C(CH₂)_(n)NH—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH—, wherein m is 1 or 0.

The hybridization product formed from this nucleic acid derivative comprises the nucleic acid derivative bound to a complementary probe. In one embodiment, the probe is attached to a glass chip.

In another embodiment, the nucleic acid labeling compounds used in the coupling have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H. alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

The hybridization product formed from this nucleic acid derivative comprises the nucleic acid derivative bound to a complementary probe. In one embodiment, the probe is attached to a glass chip.

In another embodiment, the nucleic acid labeling compounds used in the coupling have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —CH═CH(CH₂)_(n)NH—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

The hybridization product formed from this nucleic acid derivative comprises the nucleic acid derivative bound to a complementary probe. In one embodiment, the probe is attached to a glass chip.

In another embodiment, the nucleic acid labeling compounds used in the coupling have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —CH═CH(CH₂)_(n)NH—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

The hybridization product formed from this nucleic acid derivative comprises the nucleic acid derivative bound to a complementary probe. In one embodiment, the probe is attached to a glass chip.

The present invention also provides methods of synthesizing nucleic acid derivatives by attaching a nucleic acid labeling compound to a nucleic acid. It further provides methods of detecting nucleic acids involving incubating the nucleic acid derivatives with a probe.

In one embodiment, the nucleic acid labeling compounds attached to the nucleic acid have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —C(O)NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or a carboxyfluorescein; and, M is —CO(CH₂)₅NH—, wherein m is 1 or 0.

The method of nucleic acid detection using the nucleic acid derivative involves the incubation of the derivative with a probe. In one embodiment, the probe is attached to a glass chip.

In one embodiment, the nucleic acid labeling compounds attached to the nucleic acid have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

The method of nucleic acid detection using the nucleic acid derivative involves the incubation of the derivative with a probe. In one embodiment, the probe is attached to a glass chip.

In one embodiment, the nucleic acid labeling compounds attached to the nucleic acid have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —C≡C(CH₂)_(n)NH—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH—, wherein m is 1 or 0.

The method of nucleic acid detection using the nucleic acid derivative involves the incubation of the derivative with a probe. In one embodiment, the probe is attached to a glass chip.

In one embodiment, the nucleic acid labeling compounds attached to the nucleic acid have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

The method of nucleic acid detection using the nucleic, acid derivative involves the incubation of the derivative with a probe. In one embodiment, the probe is attached to a glass chip.

In one embodiment, the nucleic acid labeling compounds attached to the nucleic acid have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is —CH═CH(CH₂)_(n)NH—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or carboxyfluorescein; and, M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

The method of nucleic acid detection using the nucleic acid derivative involves the incubation of the derivative with a probe. In one embodiment, the probe is attached to a glass chip.

In one embodiment, the nucleic acid labeling compounds attached to the nucleic acid have the following structures:

wherein A is H or H₄O₉P₃—; X is O; Y is H or OR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is CH═CH(CH₂)_(n)NH—, wherein n is an integer ranging from about 1 to about 10; Q is biotin or carboxyfluorescein; and, M is —O(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

The method of nucleic acid detection using the nucleic acid derivative involves the incubation of the derivative with a probe. In one embodiment, the probe is attached to a glass chip.

In yet another embodiment, the methods involve the steps of: (a) providing at least one nucleic acid coupled to a support; (b) providing a labeled moiety capable of being coupled with a terminal transferase to said nucleic acid; (c) providing said terminal transferase; and (d) coupling said labeled moiety to said nucleic acid using said terminal transferase.

In still another embodiment, the methods involve the steps of: (a) providing at least two nucleic acids coupled to a support; (b) increasing the number of monomer units of said nucleic acids to form a common nucleic acid tail on said at least two nucleic acids; (c) providing a labeled moiety capable of recognizing said common nucleic acid tails; and (d) contacting said common nucleic acid tails and said labeled moiety.

In still yet another embodiment, the methods involve the steps of: (a) providing at least one nucleic acid coupled to a support; (b) providing a labeled moiety capable of being coupled with a ligase to said nucleic acid; (c) providing said ligase; and (d) coupling said labeled moiety to said nucleic acid using said ligase.

This invention also provides compounds of the formulas described herein.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a nonlimiting set of template moieties.

FIG. 2 shows a nonlimiting set of heterocyclic groups: 4-aminopyrazolo[3,4-d]pyrimidine, pyrazolo[3,4-d]pyrimidine, 1,3-diazole (imidazole), 1,2,4-triazine-3-one, 1,2,4-triazine-3,5-dione and 5-amino-1,2,4-triazine-3-one.

FIG. 3 shows a synthetic route to fluorescein and biotin labeled 1-(2,3-dideoxy-D-glycero-pentafuranosyl)imidazole-4-carboxamide nucleotides.

FIG. 4 shows a synthetic route to C3-labeled 4-aminopyrazolo[3,4-d]pyrimidine β-D-ribofuranoside triphosphates.

FIG. 5 shows a synthetic route to fluorescein and biotin labeled N6-dideoxy-pyrazolo[3,4-d]pyrimidine nucleotides.

FIG. 6 shows a synthetic route to N4-labeled 1,2,4-triazine-3-one β-D-ribofuranoside triphosphates.

FIG. 7 shows a synthetic route to biotin and fluorescein C5-labeled 1,2,4-triazine-3,5-dione riboside triphosphates.

FIG. 8 shows a synthetic route to biotin and fluorescein C5-labeled 5-amino-1,2,4-triazine-3-one riboside triphosphates.

FIG. 9 shows graphical comparisons of observed hybridization fluorescence intensities using Fluorescein-ddlTP and Fluorescein-ddATP.

FIG. 10 shows a graphical comparison of observed hybridization fluorescence intensities using Biotin-(M)₂-ddAPTP (wherein M=aminocaproyl) and Biotin-N6-ddATP.

FIG. 11 shows graphical comparisons of observed hybridization fluorescence intensities using Biotin-M-ddITP (wherein M=aminocaproyl) and Biotin-N6-ddATP.

FIG. 12 shows a graphical comparison of overall re-sequencing (base-calling) accuracy using Fluorescein-ddITP and Fluorescein-N6-ddATP labeled targets.

FIG. 13 shows a graphical comparison of overall re-sequencing accuracy using Biotin-M-ddITP (wherein M=aminocaproyl) and Biotin-N6-ddATP.

FIG. 14 shows a graphical comparison of re-sequencing accuracy using Biotin-(M)₂-ddAPPTP (wherein M=aminocaproyl) and Biotin-N6-ddATP.

FIG. 15 shows a schematic for the preparation of N1-labeled 3-(β-D-ribofuranosyl)-1H-pyrazalo-[4,3-d]pyrimidine 5′-triphosphate.

FIG. 16 shows a schematic for the preparation of N1-labeled 5-(β-D-ribofuranosyl)-2,4[1H,3H]-pyrimidinedione 5′-triphosphate.

FIG. 17 shows a schematic for the preparation of N-labeled 2,5-anhydro-3-deoxy-D-ribo-hexamide 6-triphosphate.

FIG. 18 shows various labeling reagents suitable for use in the methods disclosed herein. FIG. 18 a shows various labeling reagents. FIG. 18 b shows still other labeling reagents. FIG. 18 c shows non-ribose or non-2′-deoxy-ribose-containing labels. FIG. 18 d shows sugar-modified nucleotide analogue labels 18d.

FIG. 19 shows HIV array data for analog 42a (T7 labeling of RNA target).

FIG. 20 shows HPLC incorporation efficiency of C-nucleotide 42a (T7 RNA pol, 1 kb transcript).

DEFINITIONS

“Alkyl” refers to a straight chain, branched or cyclic chemical group containing only carbon and hydrogen. Alkyl groups include, without limitation, ethyl, propyl, butyl, pentyl, cyclopentyl and 2-methylbutyl. Alkyl groups are unsubstituted or substituted with 1 or more substituents (e.g., halogen, alkoxy, amino).

“Aryl” refers to a monovalent, unsaturated aromatic carbocyclic group. Aryl groups include, without limitation, phenyl, naphthyl, anthryl and biphenyl. Aryl groups are unsubstituted or substituted with 1 or more substituents (e.g. halogen, alkoxy, amino).

“Amido alkyl” refers to a chemical group having the structure —C(O)NR₃R₄—, wherein R₃ is hydrogen, alkyl or aryl, and R₄ is alkyl or aryl. Preferably, the amido alkyl group is of the structure —C(O)NH(CH₂)_(n)R₅—, wherein n is an integer ranging from about 2 to about 10, and R₅ is O, NR₆, or C(O), and wherein R₆ is hydrogen, alkyl or aryl. More preferably, the amido alkyl group is of the structure —C(O)NH(CH₂)_(n)N(H)—, wherein n is an integer ranging from about 2 to about 6. Most preferably, the amido alkyl group is of the structure —C(O)NH(CH₂)₄N(H)—.

“Alkynyl alkyl” refers to a chemical group having the structure —C≡C—R₄—, wherein R₄ is alkyl or aryl. Preferably, the alkynyl alkyl group is of the structure —C≡C—(CH₂)_(n)R₅—, wherein n is an integer ranging from 1 to about 10, and R₅ is O, NR₆ or C(O), wherein R₆ is hydrogen, alkyl or aryl. More preferably, the alkynyl alkyl group is of the structure —C≡C—(CH₂)_(n)N(H)—, wherein n is an integer ranging from 1 to about 4. Most preferably, the alkynyl alkyl group is of the structure —C≡C—CH₂N(H)—.

“Alkenyl alkyl” refers to a chemical group having the structure —CH═CH—R₄—, wherein R₄ is alkyl or aryl. Preferably, the alkenyl alkyl group is of the structure —CH═CH—(CH₂)_(n)R₅—, wherein n is an integer ranging from 1 to about 10, and R₅ is O, NR₆ or C(O), wherein R₆ is hydrogen, alkyl or aryl. More preferably, the alkenyl alkyl group is of the structure —CH═CH—(CH₂)_(n)N(H)—, wherein n is an integer ranging from 1 to about 4. Most preferably, the alkenyl alkyl group is of the structure —CH═CH—CH₂N(H)—.

“Functionalized alkyl” refers to a chemical group of the structure —(CH₂)_(n)R₇—, wherein n is an integer ranging from 1 to about 10, and R₇ is O, S, NH or C(O). Preferably, the functionalized alkyl group is of the structure —(CH₂)_(n)C(O)—, wherein n is an integer ranging from 1 to about 4. More preferably, the functionalized alkyl group is of the structure —CH₂C(O)—.

“Alkoxy” refers to a chemical group of the structure —O(CH₂)R₈—, wherein n is an integer ranging from 2 to about 10, and R₈ is O, S, NH or C(O). Preferably, the alkoxy group is of the structure —O(CH₂)_(n)C(O)—, wherein n is an integer ranging from 2 to about 4. More preferably, the alkoxy group is of the structure —OCH₂CH₂C(O)—.

“Thio” refers to a chemical group of the structure —S(CH₂)_(n)R₈—, wherein n is an integer ranging from 2 to about 10, and R₈ is O, S, NH or C(O). Preferably, the thio group is of the structure —S(CH₂)_(n)C(O)—, wherein n is an integer ranging from 2 to about 4. More preferably, the thio group is of the structure —SCH₂CH₂C(O)—.

“Amino alkyl” refers to a chemical group having an amino group attached to an alkyl group. Preferably an amino alkyl is of the structure —NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10. More preferably it is of the structure —NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 4. Most preferably, the amino alkyl group is of the structure —NH(CH₂)₄NH—.

“Nucleic acid” refers to a polymer comprising 2 or more nucleotides and includes single-, double- and triple stranded polymers. “Nucleotide” refers to both naturally occurring and non-naturally occurring compounds and comprises a heterocyclic base, a sugar, and a linking group, preferably a phosphate ester. For example, structural groups may be added to the ribosyl or deoxyribosyl unit of the nucleotide, such as a methyl or allyl group at the 2′-O position or a fluoro group that substitutes for the 2′-O group. The linking group, such as a phosphodiester, of the nucleic acid may be substituted or modified, for example with methyl phosphonates or O-methyl phosphates. Bases and sugars can also be modified, as is known in the art. “Nucleic acid,” for the purposes of this disclosure, also includes “peptide nucleic acids” in which native or modified nucleic acid bases are attached to a polyamide backbone.

The phrase “coupled to a support” means bound directly or indirectly thereto including attachment by covalent binding, hydrogen bonding, ionic interaction, hydrophobic interaction, or otherwise.

“Probe” refers to a nucleic acid that can be used to detect, by hybridization, a target nucleic acid. Preferably, the probe is complementary to the target nucleic acid along the entire length of the probe, but hybridization can occur in the presence of one or more base mismatches between probe and target.

“Perfect match probe” refers to a probe that has a sequence that is perfectly complementary to a particular target sequence. The test probe is typically perfectly complementary to a portion (subsequence) of the target sequence. The perfect match (PM) probe can be a “test probe”, a “normalization control” probe, an expression level control probe and the like. A perfect match control or perfect match probe is, however, distinguished from a “mismatch control” or “mismatch probe.” In the case of expression monitoring arrays, perfect match probes are typically preselected (designed) to be complementary to particular sequences or subsequences of target nucleic acids (e.g., particular genes). In contrast, in generic difference screening arrays, the particular target sequences are typically unknown. In the latter case, prefect match probes cannot be preselected. The term perfect match probe in this context is to distinguish that probe from a corresponding “mismatch control” that differs from the perfect match in one or more particular preselected nucleotides as described below.

“Mismatch control” or “mismatch probe”, in expression monitoring arrays, refers to probes whose sequence is deliberately selected not to be perfectly complementary to a particular target sequence. For each mismatch (MM) control in a high-density array there preferably exists a corresponding perfect match (PM) probe that is perfectly complementary to the same particular target sequence. In “generic” (e.g., random, arbitrary, haphazard, etc.) arrays, since the target nucleic acid(s) are unknown perfect match and mismatch probes cannot be a priori determined, designed, or selected. In this instance, the probes are preferably provided as pairs where each pair of probes differ in one or more preselected nucleotides. Thus, while it is not known a priori which of the probes in the pair is the perfect match, it is known that when one probe specifically hybridizes to a particular target sequence, the other probe of the pair will act as a mismatch control for that target sequence. It will be appreciated that the perfect match and mismatch probes need not be provided as pairs, but may be provided as larger collections (e.g., 3, 4, 5, or more) of probes that differ from each other in particular preselected nucleotides. While the mismatch(s) may be located anywhere in the mismatch probe, terminal mismatches are less desirable as a terminal mismatch is less likely to prevent hybridization of the target sequence. In a particularly preferred embodiment, the mismatch is located at or near the center of the probe such that the mismatch is most likely to destabilize the duplex with the target sequence under the test hybridization conditions. In a particularly preferred embodiment, perfect matches differ from mismatch controls in a single centrally-located nucleotide.

“Labeled moiety” refers to a moiety capable of being detected by the various methods discussed herein or known in the art.

Nucleic Acid Labeling Compounds

The nucleic acid labeling compounds of the present invention are of the following structure: A—O—CH₂-T-H_(c)-L-(M)_(m)-Q wherein A is hydrogen or a functional group that permits the attachment of the nucleic acid labeling compound to a nucleic acid; T is a template moiety; H_(c) is a heterocyclic group; L is a linker moiety; Q is a detectable moiety; and M is an connecting group, wherein m is an integer ranging from 0 to about 5.

The group A is either hydrogen or a functional group that permits the attachment of a nucleic acid labeling compound to a nucleic acid. Nonlimiting examples of such groups include the following: monophosphate; diphosphate; triphosphate (H₄O₉P); phosphoramidite ((R₂N)(R′O)P), wherein R is linear, branched or cyclic alkyl, and R′ is a protecting group such as 2-cyanoethyl; and H-phosphonate (HP(O)O—HNR₃), wherein R is linear, branched or cyclic alkyl.

The template moiety (T) is covalently attached to a methylene group (CH₂) at one position and a heterocyclic group (H_(c)) at another position. A nonlimiting set of template moieties is shown in FIG. 1, wherein the substituents are defined as follows: X is O, S, NR₁ or CHR₂; Y is H, N₃, F, OR₉, SR₉ or NHR₉; Z is H, N₃, F or OR₁₀; W is O, S or CH₂; D is O or S; and, G is O, NH or CH₂. The substituents R₁, R₂, R₉ and R₁₀ are independent of one another and are H, alkyl or aryl.

The heterocyclic group (H_(c)) is a cyclic moiety containing both carbon and a heteroatom. Nonlimiting examples of heterocyclic groups contemplated by the present invention are shown in FIG. 2.: 4-aminopyrazolo[3,4-d]pyrimidine; pyrazolo[3,4-d]pyrimidine; 1,3-diazole (imidazole); 1,2,4-triazine-3-one; 1,2,4-triazine-3,5-dione; and, 5-amino-1,2,4-triazine-3-one.

The linker moiety (L) of the nucleic acid labeling compound is covalently bound to the heterocycle (H_(c)) at one terminal position. It is attached to the detectable moiety (Q) at another terminal position, either directly or through a connecting group (M). It is of a structure that is sterically and electronically suitable for incorporation into a nucleic acid. Nonlimiting examples of linker moieties include amido alkyl groups, alkynyl alkyl groups, alkenyl alkyl groups, functionalized alkyl groups, alkoxyl groups, thio groups and amino alkyl groups.

Amido alkyl groups are of the structure —C(O)NR₃R₄—, wherein R₃ is hydrogen, alkyl or aryl, and R₄ is alkyl or aryl. The amido alkyl group is preferably of the structure —C(O)NH(CH₂)_(n)R₅—, wherein n is an integer ranging from about 2 to about 10 and R₅ is O, NR₆ or C(O), and wherein R₆ is hydrogen, alkyl or aryl. More preferably, the amido alkyl group is of the structure —C(O)NH(CH₂)_(n)N(H)—, wherein n is an integer ranging from about 2 to about 6. Most preferably, the amido alkyl group is of the structure —C(O)NH(CH₂)₄N(H)—.

Alkynyl alkyl groups are of the structure —C≡C—R₄—, wherein R₄ is alkyl or aryl. The alkynyl alkyl group is preferably of the structure —C≡C(CH₂)_(n)R₅—, wherein n is an integer ranging from 1 to about 10 and R₅ is O, NR₆ or C(O), and wherein R₆ is hydrogen, alkyl or aryl. More preferably, the alkynyl alkyl group is of the structure —C≡C—(CH₂)_(n)N(H)—, wherein n is an integer ranging from 1 to about 4. Most preferably, the alkynyl alkyl group is of the structure —C≡C—CH₂N(H)—.

Alkenyl alkyl groups are of the structure —CH═CH—R₄—, wherein R₄ is alkyl or aryl. The alkenyl alkyl group is preferably of the structure —CH═CH(CH₂)_(n)R₅—, wherein n is an integer ranging from 1 to about 10, and R₅ is O, NR₆ or C(O), and wherein R₆ is hydrogen, alkyl or aryl. More preferably, the alkenyl alkyl group is of the structure —CH═CH(CH₂)_(n)NH—, wherein n is an integer ranging from 1 to about 4. Most preferably, the alkenyl alkyl group is of the structure —CH═CHCH₂NH—.

Functionalized alkyl groups are of the structure —(CH₂)_(n)R₇—, wherein n is an integer ranging from 1 to about 10, and R₇ is O, S, NH, or C(O). The functionalized alkyl group is preferably of the structure —(CH₂)C(O)—, wherein n is an integer ranging from 1 to about 4. More preferably, the functionalized alkyl group is —CH₂C(O)—.

Alkoxy groups are of the structure —O(CH₂)_(n)R₈—, wherein n is an integer ranging from 2 to about 10, and R₈ is O, S, NH, or C(O). The alkoxy group is preferably of the structure —O(CH₂)_(n)C(O)—, wherein n is an integer ranging from 2 to about 4. More preferably, the alkoxy group is of the structure —OCH₂CH₂C(O)—.

Thio groups are of the structure —S(CH₂)_(n)R₈—, wherein n is an integer ranging from 2 to about 10, and R₈ is O, S, NH, or C(O). The thio group is preferably of the structure —S(CH₂)_(n)C(O)—, wherein n is an integer ranging from 2 to about 4. More preferably, the thio group is of the structure —SCH₂CH₂C(O)—.

Amino alkyl groups comprise an amino group attached to an alkyl group. Preferably, amino alkyl groups are of the structure —NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10. The amino alkyl group is more preferably of the structure —NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 4. Most preferably, the amino alkyl group is of the structure —NH(CH₂)₄NH—.

The detectable moiety (O) is a chemical group that provides an signal. The signal is detectable by any suitable means, including spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means. In certain cases, the signal is detectable by 2 or more means.

The detectable moiety provides the signal either directly or indirectly. A direct signal is produced where the labeling group spontaneously emits a signal, or generates a signal upon the introduction of a suitable stimulus. Radiolabels, such as ³H, ¹²⁵I, ³⁵S, ¹⁴C or ³²P, and magnetic particles, such as Dynabeads™, are nonlimiting examples of groups that directly and spontaneously provide a signal. Labeling groups that directly provide a signal in the presence of a stimulus include the following nonlimiting examples: colloidal gold (40-80 nm diameter), which scatters green light with high efficiency; fluorescent labels, such as fluorescein, texas red, rhodamine, and green fluorescent protein (Molecular Probes, Eugene, Oreg.), which absorb and subsequently emit light; chemiluminescent or bioluminescent labels, such as luminol, lophine, acridine salts and luciferins, which are electronically excited as the result of a chemical or biological reaction and subsequently emit light; spin labels, such as vanadium, copper, iron, manganese and nitroxide free radicals, which are detected by electron spin resonance (ESR) spectroscopy; dyes, such as quinoline dyes, triarylmethane dyes and acridine dyes, which absorb specific wavelengths of light; and colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. See U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149 and 4,366,241.

A detectable moiety provides an indirect signal where it interacts with a second compound that spontaneously emits a signal, or generates a signal upon the introduction of a suitable stimulus. Biotin, for example, produces a signal by forming a conjugate with streptavidin, which is then detected. See Hybridization With Nucleic Acid Probes. In Laboratory Techniques in Biochemistry and Molecular Biology; Tijssen, P., Ed.; Elsevier: New York, 1993; Vol. 24. An enzyme, such as horseradish peroxidase or alkaline phosphatase, that is attached to an antibody in a label-antibody-antibody as in an ELISA assay, also produces an indirect signal.

A preferred detectable moiety is a fluorescent group. Flourescent groups typically produce a high signal to noise ratio, thereby providing increased resolution and sensitivity in a detection procedure. Preferably, the fluorescent group absorbs light with a wavelength above about 300 nm, more preferably above about 350 nm, and most preferably above about 400 nm. The wavelength of the light emitted by the fluorescent group is preferably above about 310 nm, more preferably above about 360 nm, and most preferably above about 410 nm.

The fluorescent detectable moiety is selected from a variety of structural classes, including the following nonlimiting examples: 1- and 2-aminonaphthalene, p,p′diaminostilbenes, pyrenes, quaternary phenanthridine salts, 9-aminoacridines, p,p′-diaminobenzophenone imines, anthracenes, oxacarbocyanine, marocyanine, 3-aminoequilenin, perylene, bisbenzoxazole, bis-p-oxazolyl benzene, 1,2-benzophenazin, retinol, bis-3-aminopridinium salts, hellebrigenin, tetracycline, sterophenol, benzimidazolyl phenylamine, 2-oxo-3-chromen, indole, xanthen, 7-hydroxycoumarin, phenoxazine, salicylate, strophanthidin, porphyrins, triarylmethanes, flavin, xanthene dyes (e.g., fluorescein and rhodamine dyes); cyanine dyes; 4,4-difluoro-4-bora-3a,4a-diaza-s-indacene dyes and fluorescent proteins (e.g., green fluorescent protein, phycobiliprotein).

A number of fluorescent compounds are suitable for incorporation into the present invention. Nonlimiting examples of such compounds include the following: dansyl chloride; fluoresceins, such as 3,6-dihydroxy-9-phenylxanthhydrol; rhodamineisothiocyanate; N-phenyl-1-amino-8-sulfonatonaphthalene; N-phenyl-2-amino-6-sulfonatonaphthanlene; 4-acetamido-4-isothiocyanatostilbene-2,2′-disulfonic acid; pyrene-3-sulfonic acid, 2-toluidinonapththalene-6-sulfonate; N-phenyl, N-methyl 2-aminonaphthalene-6-sulfonate; ethidium bromide; stebrine; auromine-0,2-9′-anthroyl)palmitate; dansyl phosphatidylethanolamin; N,N′-dioctadecyl oxacarbocycanine; N,N′-dihexyl oxacarbocyanine; merocyanine, 4-(3′-pyrenyl)butryate; d-3-aminodesoxy-equilenin; 12-(9′-anthroyl)stearate; 2-methylanthracene; 9-vinylanthracene; 2,2′-(vinylene-p-phenylene)bisbenzoxazole; p-bis[2-(4-methyl-5-phenyl oxazolyl)]benzene; 6-dimethylamino-1,2-benzophenzin; retinol; bis(3′-aminopyridinium)-1,10-decandiyl diiodide; sulfonaphthylhydrazone of hellibrienin; chlorotetracycline; N-(7-dimethylamino-4-methyl-2-oxo-3-chromenyl)maleimide; N-[p-(2-benzimidazolyl)phenyl]maleimide; N-(4-fluoranthyl)maleimide; bis(homovanillic acid); resazarin; 4-chloro-7-nitro-2,1,3-benzooxadizole; merocyanine 540; resorufin; rose bengal and 2,4-diphenyl-3 (2H)-furanone. Preferably, the fluorescent detectable moiety is a fluorescein or rhodamine dye.

Another preferred detectable moiety is colloidal gold. The colloidal gold particle is typically 40 to 80 nm in diameter. The colloidal gold may be attached to a labeling compound in a variety of ways. In one embodiment, the linker moiety of the nucleic acid labeling compound terminates in a thiol group (—SH), and the thiol group is directly bound to colloidal gold through a dative bond. See Mirkin et al. Nature 1996, 382, 607-609. In another embodiment, it is attached indirectly, for instance through the interaction between colloidal gold conjugates of antibiotin and a biotinylated labeling compound. The detection of the gold labeled compound may be enhanced through the use of a silver enhancement method. See Danscher et al. J Histotech 1993, 16, 201-207.

The connecting groups (M)_(m) may serve to covalently attach the linker group (L) to the detectable moiety (Q). Each M group can be the same or different and can independently be any suitable structure that will not interfere with the function of the labeling compound. Nonlimiting examples of M groups include the following: amino alkyl, —CO(CH₂)₅NH—, —CO—, —CO(O)—, —CO(NH)—, —CO(CH₂)₅NHCO(CH₂)₅NH—, —NH(CH₂CH₂O)_(k)NH—, and —CO(CH₂)₅—; wherein, k is an integer from 1 to about 5, preferably k is 1 or 2; m is an integer ranging from 0 to about 5, preferably 0 to about 3.

In one embodiment, the nucleic acid labeling compounds of the present invention are of the following structure:

wherein L is a linker moiety; Q is a detectable moiety; X is O, S, NR₁ or CHR₂; Y is H, N₃, F, OR₉, SR₉ or NHR₉; Z is H, N₃, F or OR₁₀; H_(c) is a heterocyclic group; A is H or a functional group that permits the attachment of the nucleic acid label to a nucleic acid; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3. The substituents R₁, R₂, R₉ and R₁₀ are independent of one another and are H, alkyl or aryl.

In one embodiment, the heterocyclic group (H_(c)) is an imidazole, and the nucleic acid labeling compounds have the following structures:

wherein L is a linker moiety; Q is a detectable moiety; X is O, S, NR₁ or CHR₂; Y is H, N₃, F, OR₉, SR₉ or NHR₉; Z is H, N₃, F or OR₁₀; A is H or a functional group that permits the attachment of the nucleic acid label to a nucleic acid; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3. The substituents R₁, R₂, R₉ and R₁₀ are independent of one another and are H, alkyl or aryl.

In a preferred embodiment, the heterocyclic group (H_(c)) is an imidazole and the linking moiety is amido alkyl:

wherein Y is hydrogen or hydroxyl; Z is hydrogen or hydroxyl; R₃ is hydrogen or alkyl; R₄ is —(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about 10; Q is biotin or carboxyfluorescein; A is hydrogen or H₄O₉P₃—; and, M is —CO(CH₂)₅NH— or —CO—, wherein m is 1 or 0. More preferably, Y and Z are hydrogen; R₃ is hydrogen; R₄ is —(CH₂)₄NH—; A is H₄O₉P₃—; and, Q is biotin, wherein M is —CO(CH₂)₅NH— and m is 1, or 5- or 6-carboxyfluorescein, wherein m is 0.

In another embodiment, the heterocyclic group (H_(c)) is a C3 substituted 4-amino-pyrazolo[3,4-d]pyrimidine, and the nucleic acid labeling compounds have the following structures:

wherein L is a linker moiety; Q is a detectable moiety; X is O, S, NR₁ or CHR₂; Y is H, N₃, F, OR₉, SR₉ or NHR₉; Z is H, N₃, F or OR₁₀; A is H or a functional group that permits the attachment of the nucleic acid label to a nucleic acid; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3. The substituents R₁, R₂, R₉ and R₁₀ are independent of one another and are H, alkyl or aryl.

In a preferred embodiment, the heterocyclic group (H_(c)) is a C3 substituted 4-aminopyrazolo[3,4-d]pyrimidine and the linking group is an alkynyl alkyl:

wherein Y is hydrogen or hydroxyl; Z is hydrogen or hydroxyl; n is an integer ranging from 1 to about 10; R₅ is O or NH; A is hydrogen or H₄O₉P₃—; Q is biotin or carboxyfluorescein; M is —CO(CH₂)₅NH—, wherein m is 1 or 0. More preferably, Y and Z are OH; n is 1; R₅ is NH; A is H₄O₉P₃—; and, Q is biotin or 5- or 6-carboxyfluorescein, wherein m is 1.

In another embodiment, the heterocyclic group (H_(c)) is an C4 substituted pyrazolo[3,4-d]pyrimidine, and the nucleic acid labeling compounds have the following structures:

wherein L is a linker moiety; Q is a detectable moiety; X is O, S, NR₁ or CHR₂; Y is H, N₃, F, OR₉, SR₉ or NHR₉; Z is H, N₃, F or OR₁₀; A is H or a functional group that permits the attachment of the nucleic acid label to a nucleic acid; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3. The substituents R₁, R₂, R₉ and R₁₀ are independent of one another and are H, alkyl or aryl.

In a preferred embodiment, the heterocyclic group (H_(c)) is an N4 substituted 4-amino-pyrazolo[3,4-d]pyrimidine and the linking group is an amino alkyl:

wherein Y is hydrogen or hydroxyl; Z is hydrogen or hydroxyl; n is an integer ranging from about 2 to about 10; A is hydrogen or H₄O₉P₃—; Q is biotin or carboxyfluorescein; M is —O(CH₂)₅NH— or —O(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0. More preferably, Y and Z are hydrogen; n is 4; A is H₄O₉P₃—; Q is biotin or 5- or 6-carboxyfluorescein, wherein m is 0.

In another embodiment, the heterocyclic group (H_(c)) is a 1,2,4-triazine-3-one, and the nucleic acid labeling compounds have the following structures:

wherein L is a linker moiety; Q is a detectable moiety; X is O, S, NR₁ or CHR₂; Y is H, N₃, F, OR₉, SR₉ or NHR₉; Z is H, N₃, F or OR₁₀; A is H or a functional group that permits the attachment of the nucleic acid label to a nucleic acid; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3. The substituents R₁, R₂, R₉ and R₁₀ are independent of one another and are H, alkyl or aryl.

In a preferred embodiment, the heterocyclic group (H_(c)) is a 1,2,4-triazine-3-one and the linking group is amino alkyl:

wherein Y is hydrogen or hydroxyl; Z is hydrogen or hydroxyl; n is an integer ranging from about 2 to about 10; A is hydrogen or H₄O₉P₃—; Q is biotin or carboxyfluorescein; M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0. More preferably, Y and Z are hydroxyl; n is 4; A is H₄O₉P₃—; Q is biotin or 5- or 6-carboxyfluorescein, wherein M is —CO(CH₂)₅NH—, and m is 1.

In another embodiment, the heterocyclic group (H_(c)) is a 1,2,4-triazine-3,5-dione, and the nucleic acid labeling compounds have the following structures:

wherein L is a linker moiety; Q is a detectable moiety; X is O, S, NR₁, or CHR₂; Y is H, N₃, F, OR₉, SR₉ or NHR₉; Z is H, N₃, F or OR₁₀; A is H or a functional group that permits the attachment of the nucleic acid label to a nucleic acid; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3. The substituents R₁, R₂, R₉ and R₁₀ are independent of one another and are H, alkyl or aryl.

In a preferred embodiment, the heterocyclic group (H_(c)) is a 1,2,4-triazine-3,5-dione and the linking group is alkenyl alkyl:

wherein Y is hydrogen or hydroxyl; Z is hydrogen or hydroxyl; n is an integer ranging from about 1 to about 10; R₅ is NR₆, wherein R₆ is hydrogen, alkyl or aryl; A is hydrogen or H₄O₉P₃—; Q is biotin or carboxyfluorescein; M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

In another embodiment, the heterocyclic group (H_(c)) is a 5-amino-1,2,4-triazine-3-one, and the nucleic acid labeling compounds have the following structures:

wherein L is a linker moiety; Q is a detectable moiety; X is O, S, NR₁ or CHR₂; Y is H, N₃, F, OR₉, SR₉ or NHR₉; Z is H, N₃, For OR₁₀; A is H or a functional group that permits the attachment of the nucleic acid label to a nucleic acid; and, M is a connecting group, wherein m is an integer ranging from 0 to about 3. The substituents R₁, R₂, R₉ and R₁₀ are independent of one another and are H, alkyl or aryl.

In a preferred embodiment, the heterocyclic group (H_(c)) is a 5-amino-1,2,4-triazine-3-one and the linking group is alkenyl alkyl:

wherein Y is hydrogen or hydroxyl; Z is hydrogen or hydroxyl; n is an integer ranging from about 1 to about 10; R₅ is NR₆, wherein R₆ is hydrogen, alkyl or aryl; A is hydrogen or H₄O₉P₃—; Q is biotin or carboxyfluorescein; M is —CO(CH₂)₅NH— or —CO(CH₂)₅NHCO(CH₂)₅NH—, wherein m is 1 or 0.

In a preferred embodiment, the nucleic acid labeling compounds have the formulas:

wherein Q is biotin or a carboxyfluorescein.

In another embodiment, the nucleic acid labeling compounds have the formulas:

wherein R₁₁ is hydrogen, hydroxyl, a phosphate linkage, or a phosphate group; R₁₂ is hydrogen or hydroxyl; R₁₃ is hydrogen, hydroxyl, a phosphate linkage, or a phosphate group; and R₁₄ is a coupled labeled moiety. Synthesis of Nucleic Acid Labeling Compounds

FIG. 3 shows a synthetic route to nucleic acid labeling compounds 8a and 8b, in which the heterocyclic group (H_(c)) is an imidazole and the linker moiety (L) is an amido alkyl. The silyl protected imidazole (2) was added to pentofuranose (1) to provide a mixture of carboethoxyimidazole dideoxyriboside isomers (3a-3d). The isomers were separated to afford purified 3c. The carboethoxy group of 3c was converted into an amino carboxamide (4) upon treatment with a diamine. The terminal amine of 4 was protected to give the trifluoroacetylated product 5. The silyl protecting group of 5 was removed, providing the primary alcohol 6. Compound 6 was converted into a 5′-triphosphate to afford 7. The trifluoroacetyl protecting group of 7 was removed, and the deprotected amine was reacted with biotin —NH(CH₂)₅CO—NHS or 5-carboxyfluorescein-NHS giving, respectively, nucleic acid labeling compounds 8a and 8b.

FIG. 4 shows a synthetic route to C3-labeled 4-aminopyrazolo[3,4-d]pyrimidine β-D-ribofuranoside triphosphates. A protected propargylamine linker was added to nucleoside (9) under palladium catalysis to provide the coupled product (10). The primary alcohol of the alkyne substituted nucleoside (10) was phosphorylated, yielding the 5′-triphosphate 11. The protected amine of triphosphate 11 was then deprotected, and the resulting primary amine was treated with a reactive biotin or fluorescein derivative to afford, respectively, nucleic acid labeling compounds 12a and 12b.

FIG. 5 shows a synthetic route to pyrazolopyrimidine nucleotides. A chloropyrazolopyrimidine (13) was added to pentofuranose 1 to provide adduct 14 as a mixture of anomers. A diamine was added to compound 14, affording a mixture of primary amines (15). The primary amines (15) were protected and chromatographically separated to yield the pure β-anomer 16. The silyl group of 16 was removed and the resulting primary alcohol was phosphorylated to provide triphosphate 17. The trifluoroacetyl group of 17 was removed and the deprotected amine was treated with a reactive biotin or carboxyfluorescein derivative giving, respectively, nucleic acid labeling compounds 18a-18d.

FIG. 6 shows a synthetic route to N4-labeled 1,2,4-triazine-3-one β-D-ribofuranoside triphosphates. 1,2,4-Triazine-3,5-dione ribonucleoside 19 was converted into the triazole nucleoside 20 upon treatment with triazole and phosphorous trichloride. Addition of a diamine to 20 provided aminoalkyl nucleoside 21. The primary amine of 21 was protected, affording trifluoroacetamide 22. The primary alcohol of 22 was phosphorylated, and the protected amine was deprotected and reacted with a reactive biotin or carboxyfluorescein derivative, giving, respectively, nucleic acid labeling compounds 23a and 23b.

FIG. 7 shows a synthetic route to C5-labeled 1,2,4-triazine-3,5-dione riboside phosphates. Aldehyde 24 is reacted with ylide 25 to provide the phthalimide protected allylamine 26. Compound 26 is coupled with pentofuranoside 27, yielding nucleoside 28. The phthalimide group of 28 is removed upon treatment with hydrazine to afford primary amine 29. Amine 29 is protected as amide 30. Amide 30 is phosphorylated, deprotected and treated with a reactive derivative of biotin or carboxyfluorescein, giving, respectively, nucleic acid labeling compounds 31a and 31b.

FIG. 8 shows a synthetic route to C5-labeled 5-amino-1,2,4-triazine-3-one riboside triphosphates. Compound 28 is converted into the amino-1,3-6-triazine compound 32 upon treatment with a chlorinating agent and ammonia. The phthalimide group of 32 is removed upon treatment with hydrazine, and the resulting primary amine is protected to provide 33. Compound 33 is phosphorylated, deprotected and treated with a reactive derivative of biotin or carboxyfluorescein, giving, respectively, nucleic acid labeling compounds 34a and 34b.

Nucleic Acid Labeling

Nucleic acids can be isolated from a biological sample or synthesized, on a solid support or in solution for example, according to methods known to those of skill in the art. As used herein, there is no limitation on the length or source of the nucleic acid used in a labeling process. Exemplary methods of nucleic acid isolation and purification are described in Theory and Nucleic Acid Preparation. In Laboratory Techniques in Biochemistry and Molecular Biology: Hybridization With Nucleic Acid Probes; P. Tijssen, Ed.; Part I; Elsevier: N.Y., 1993. A preferred method of isolation involves an acid guanidinium-phenol-chloroform extraction followed by oligo dT column chromotography or (dT)n magnetic bead use. Sambrook et al. Molecular Cloning: A Laboratory Manual, 2nd ed.; Cold Spring Harbor Laboratory, 1989; Vols. 1-3; and Current Protocols in Molecular Biology; F. Ausubel et al. Eds.; Greene Publishing and Wiley Interscience: N.Y., 1987.

In certain cases, the nucleic acids are increased in quantity through amplification. Suitable amplification methods include, but are not limited to, the following examples: polymerase chain reaction (PCR) (Innis, et al. PCR Protocols. A guide to Methods and Application; Academic Press: San Diego, 1990); ligase chain reaction (LCR) (Wu and Wallace. Genomics 1989, 4, 560; Landgren, et al. Science 1988, 241, 1077; and Barringer, et al. Gene 1990, 89, 117); transcription amplification (Kwoh et al. Proc. Natl. Acad. Sci. USA 1989, 86, 1173); and self-sustained sequence replication (Guatelli, et al. Proc. Nat. Acad. Sci. USA 1990, 87, 1874).

The nucleic acid labeling compound can be incorporated into a nucleic acid using a number of methods. For example, it can be directly attached to an original nucleic acid sample (e.g., mRNA, polyA mRNA, cDNA) or to an amplification product. Methods of attaching a labeling compound to a nucleic acid include, without limitation, nick translation, 3-end-labeling, ligation, in vitro transcription (IVT) or random priming. Where the nucleic acid is an RNA, a labeled riboligonucleotide is ligated, for example, using an RNA ligase such as T4 RNA Ligase. In The Enzymes; Uhlenbeck and Greensport, Eds.; Vol. XV, Part B, pp. 31-58; and, Sambrook et al., pp. 5.66-5.69. Terminal transferase is used to add deoxy-, dideoxy- or ribonucleoside triphosphates (dNTPs, ddNTPs or NTPs), for example, where the nucleic acid is single stranded DNA.

The labeling compound can also be incorporated at an internal position of a nucleic acid. For example, PCR in the presence of a labeling compound provides an internally labeled amplification product. See, e.g., Yu et al. Nucleic Acids Research 1994, 22, 3226-3232. Similarly, IVT in the presence of a labeling compound can provide an internally labeled nucleic acid.

Probe Hybridization

The nucleic acid to which the labeling compound is attached can be detected after hybridization with a nucleic acid probe. Alternatively, the probe can be labeled, depending upon the experimental scheme preferred by the user. The probe is a nucleic acid, or a modified nucleic acid, that is either attached to a solid support or is in solution. It is complementary in structure to the labeled nucleic acid with which it hybridizes. The solid support is of any suitable material, including polystyrene based beads and glass chips. In a preferred embodiment, the probe or target nucleic acid is attached to a glass chip, such as a GeneChip® product (Afrymetrix, Inc., Santa Clara, Calif.). See International Publication Nos. WO 97/10365, WO 97/29212, WO 97/27317, WO 95/11995, WO 90/15070, and U.S. Pat. Nos. 5,744,305 and 5,445,934 which are hereby incorporated by reference.

Because probe hybridization is often a step in the detection of a nucleic acid, the nucleic acid labeling compound must be of a structure that does not substantially interfere with that process. The steric and electronic nature of the labeling compound, therefore, is compatible with the binding of the attached nucleic acid to a complementary structure.

EXAMPLES

The following examples are offered to illustrate, but not to limit, the present invention.

General Experimental Details

Reagents were purchased from Aldrich Chemical Company (Milwaukee, Wis.) in the highest available purity. All listed solvents were anhydrous. Intermediates were characterized by ¹H NMR and mass spectrometry.

Example 1 Synthesis of Fluorescein- and Biotin-labeled 1-(2,3-dideoxy-β-D-glycero-pentafuranosyl)imidazole-4-carboxamide nucleotides.

1-O-acetyl-5-O-(t-butyldimethylsilyl)-2,3-dideoxy-D-glycero-pentafuranose 1 (9.4 g, 34.2 mmole) (see, Duelholm, K.; Penderson, E. B., Synthesis, 1992, 1) and 1-trimethylsilyl-4-carboethoxyimidazole 2 (6.3 g; 34.2 mmole) (see, Pochet, S, et. al., Bioorg. Med. Chem. Lett., 1995, 5, 1679) were combined in 100 ml dry DCM under Ar, and trimethylsilyl triflate catalyst (6.2 ml; 34.2 mmole) was added at 0° C. The solution was allowed to stir at room temperature for 5 hours and was then washed 3× with 100 ml of saturated aqueous NaHCO₃, 1× with saturated aqueous NaCl, dried with NaSO₄ and evaporated to provide 14 g of a crude mixture of four carboethoxyimidazole dideoxyriboside isomers (3a-d), corresponding to α and β-anomers of both N1 and N3 alkylation products. The isomeric products were purified and separated by flash chromatography (silica gel, EtOAc-hexane), in 52% total yield. The β-N1 isomer (2.2 g; 18% yield), was identified by ¹H-NMR chemical shift and NOE data (see, Pochet, S, et. al., Bioorg. Med. Chem. Lett., 1995, 5, 1679). Purified 3c (0.5 g; 1.4 mmole) was heated with a 20-fold excess of 1,4-diaminobutane (3.0 ml, 30 mmole) neat at 145° C. for 4 hours, and then the resulting mixture was diluted with 50 ml EtOAc, washed 3× with water, 1× with brine, and dried with NaSO₄ and evaporated to provide 500 mg (95%) of the imidazole-4-(4-aminobutyl)carboxamide dideoxyriboside 4 as a colorless oil. After coevaporation with toluene, 4 (393 mg; 0.75 mmole) was combined with trifluoroacetylimidazole (94 uL; 0.83 mmole) in 5 ml dry THF at 0° C., and stirred for 10 minutes. The solvent was evaporated, and the oily residue taken up in 50 ml EtOAc, extracted 2× with saturated aqueous NaHCO₃, 1× with saturated aqueous NaCl, dried with NaSO₄, and evaporated to yield 475 mg (99%) of the N-TFA protected nucleoside 5 as a colorless oil. The TBDMS group was removed by addition of excess triethylamine trihydrofluoride (2.3 ml; 14.4 mmole) in 20 ml dry THF and stirring overnight. The THF was evaporated in vacuo, the residue was taken up in 50 ml EtOAc and the solution was washed carefully with a 1:1 mixture of saturated aqueous NaHCO₃ and brine until neutral, then dried with NaSO₄, and evaporated to yield 340 mg (96%) of the 5 as a pale yellow oil. The NMR & MS data were consistent with the assigned structure.

Nucleoside 6 was converted to a 5′-triphosphate, deprotected, reacted with biotin —NH(CH₂)₅CO—NHS or 5-carboxyfluorescein-NHS and purified according to procedures reported elsewhere (see, Prober, J. M., et al., 1988, PCT 0 252 683 A2) to give the labeled nucleotides 8a,b in >95% purity by HPLC, ³¹P-NMR.

Example 2 Synthesis of C3-Labeled 4-aminopyrazolo[3,4-d]pyrimidine β-D-ribofuranoside triphosphates

The synthesis of 3-iodo-4-aminopyrazolo[3,4-d]pyrimidine ribofuranside (9) was carried out as described by H. B. Cottam, et al. 1993, J. Med. Chem. 36:3424. Using the appropriate deoxyfuranoside precursors, both the 2′-deoxy and 2′,3′-dideoxy nucleosides are prepared using analogous procedures. See, e.g., U. Neidballa & H. Vorbruggen 1974, J. Org. Chem. 39:3654; K. L. Duehom & E. B. Pederson 1992, Synthesis 1992:1). Alternatively, these are prepared by deoxygenation of ribofuranoside 9 according to established procedures. See, M. J. Robins et al. 1983 J. Am. Chem. Soc. 103:4059; and, C. K. Chu, et al. 1989 J. Org. Chem. 54:2217.

A protected propargylamine linker was added to the 4-aminopyrazolo[3,4d] pyrimidine nucleoside (9) via organopalladium-mediated substitution to the 3-position of 4-aminopyrazolo[3,4-4-d]pyrimidine riboside using the procedure described by Hobbs (J. Org. Chem. 54: 3420; Science 238: 336.). Copper iodide (38 mg; 0.2 mmole), triethylamine (560 uL; 4.0 mmole), N-trifluoroacetyl-3-aminopropyne (700 uL; 6.0 mmole) and 3-iodo-4-aminopyrazolo[3,4-d]pyrimidine β-D-nbofuranoside (9) (H. B. Cottam, et al., 1993, J. Med. Chem. 36: 3424.) (786 mg; 2.0 mmole) were combined in 5 ml of dry DMF under argon. To the stirring mixture was added tetrakis(triphenylphosphine) palladium(0) (232 mg; 0.2 mmole). The solution became homogeneous within 10 minutes, and was left stirring for an additional 4 hours in the dark, at which time the reaction was diluted with 20 mL of MEOH-DCM (1:1), 3.3 g of Dowex AG-1 anion exchange resin (bicarbonate form) was added, and stirring was continued for another 15 minutes. The resin was removed by filtration and washed with MEOH-DCM (1:1), and the combined filtrates were evaporated to dryness. The residue was dissolved in 4 mL of hot MeOH, then 15 mL DCM was added and the mixture kept warm to maintain a homogeneous solution while it was loaded onto a 5 cm×25 cm column of silica gel that had been packed in 1:9 MeOH-DCM. The product (R_(f)˜0.4, 6:3:1:1 DCM-EtOAc-MeOH—HOAc) was eluted with a 10-15-20% MeOH-DCM step gradient. The resulting pale yellow solid was washed 3× with 2.5 ml of ice-cold acetonitrile, then 2× with ether and dried in vacuo to obtain 630 mg (75%) of 4-amino-3-(N-trifluoroacetyl-3-aminopropynyl)pyrazolo[3,4-d]pyrimidine β-D-ribofuranoside (10). Identity of the product was confirmed by ¹H-nmr, mass spectrometry and elemental analysis.

The nucleoside was converted to a 5′-triphosphate (11), deprotected, reacted with oxysuccinimidyl-(N-biotinoyl-6-amino)hexanoate, or oxysuccinimidyl-(N-(fluorescein-5-carboxyl)-6-amino)hexanoate, and purified according to procedures reported elsewhere (Prober, J. M., et al., 1988, PCT 0 252 683 A2.) to give the biotin- and fluorescein-labeled nucleotides (12a, 12b) in >95% purity.

Example 3 Synthesis of Fluorescein- and Biotin-N6-dideoxy-pyrazalo[3,4-d]pyrimidine Nucleotides

1-O-acetyl-5-O (t-butyldimethylsilyl)-2,3-dideoxy-D-glycero-pentofuranose (1) and 1-trimethylsilyl-4-chloropyrazolo[3,4-d]pyrimidine (13) were synthesized according to literature procedures. Duelholm, K. L.; Penderson, E. B., Synthesis 1992, 1-22; and, Robins, R. K., J Amer Chem Soc. 1995, 78, 784-790. To 2.3 g (8.3 mmol) of 1 and 1.9 g (8.3 mmol, 1 eq) of 13 in 40 ml of dry DCM at 0° C. under argon was added slowly over 5 minutes 1.5 mL (8.3 mmol, 1 eq) of trimethylsilyl triflate. After 30 min. 4.2 ml (41.5 mmol, 5 eq) of 1,2-diaminobutane was added rapidly and the reaction was stirred at room temperature for 1 hr. The solvent was evaporated; the residue was dissolved in 50 ml of ethylacetate and washed with 50 ml of saturated aqueous. NaHCO₃ and dried over Na₂SO₄, filtered and the solvent evaporated to yield 4.2 g of a yellow foam. The foam was dissolved in 100 ml of diethyl ether and 100 ml of hexanes was added to precipitate the product as an oil. The solvent was decanted and the oil was dried under high vacuum to give 3.4 g of 15 as a pale yellow foam. HPLC, UV and MS data were consistent with a 2:1 mixture of the α- and β-anomers.

To the crude mixture of isomers (3.4 g, 8.1 mmol, ˜50% pure) in 140 ml of dry THF at 0° C. under argon was added slowly 1.0 ml of 1-trifluoroacetylimidazole (8.9 mmol, 1.1 eq). The reaction was followed by RP-HPLC. An additional 5% of the acylating agent was added to completely convert the starting material to mixture of TFA-protected anomers. Bergerson, R. G.; McManis, J. S J. Org. Chem 1998, 53, 3108-3111. The reaction was warmed to room temperature, and then the solvent was evaporated to about 25 ml volume and diluted with 100 ml of ethylacetate. The solution was extracted twice with 25 ml of 1% aq. NaHCO₃, once with brine, then dried over Na₂SO₄ and evaporated to afford 3.4 g of yellow foam. The crude material was purified by flash chromatography on silica gel in EtOAc-hexanes to give 1.3 g of the α-anomer and 0.7 g of the β-anomer of 16 (50% total yield). The 1H-NMR and MS data were consistent with the assigned structure and stereochemistry.

To 1.3 g (2.5 mmol) of 16 (α-anomer) in 50 ml of dry THF under argon was added 1 ml (13.6 mmol) of triethylamine and 6.1 ml (37.5 mmol, 15 eq) of triethylamine trihydrofluoride. After stirring for 16 hr., the solvent was evaporated, and residual triethylamine trihydrofluoride removed under high vacuum. Pirrung, M. C.; et al. Biorg. Med. Chem. Lelf. 1994, 4, 1345-1346. The residue was dissolved in 100 ml of ethylacetate and washed carefully with 4×100 ml of sat. aq. NaHCO₃, once with brine, then dried over Na₂SO₄ and evaporated to give 850 mg (95%) of white foam. 1H-NMR, UV and MS data were consistent with the assigned structure of the desilylated nucleoside, which was used in the next step without further purification.

The nucleoside was converted to the triphosphate using the Eckstein phosphorylation procedure (Ludwig, J. L.; Eckstein, F. J. Org. Chem. 1989, 54, 631-635) followed by HPLC purification on a ResourceQ anion exchange column (buffer A is 20 mM Tri pH 8, 20% CH₃CN and buffer B is 20 mM Tris pH 8, 1 M NaCl, 20% CH3CN). ³¹P-NMR, UV and MS data were consistent with the structure of the triphosphate. The trifluoroacetyl-protecting group was removed by treatment with excess NH₄OH at 55° C. for 1 hr. followed by evaporation to dryness. The mass spectral data were consistent with the aminobutyl nucleotide 17. Without further purification, the nucleotide was treated with either Biotin-NHS esters or 5-Carboxyfluorescein-NHS as described elsewhere (Prober, J. M., et al., 1988, PCT 0 252 683 A2) to form the labeled nucleotides 18a-18d, respectively, which were purified by HPLC as described (Prober, J. M., et al., 1988, PCT 0 252 683 A2) except that, in the case of 18a, the buffer was 20 mM sodium phosphate pH 6. The ³¹P-NMR and UW data were consistent with the structure of the labeled analogs.

Example 4 Synthesis of N4-labeled 1,2,4-triazine-3-one β-D-ribofuranoside triphosphates

To a solution of 1,2,4-triazole (6.7 g; 97 mmole) in 30 mL dry ACN was added POCl₃ (2.1 mL; 22 mmole) slowly with stirring under argon. After 30 minutes, the solution was cooled to 0° C., and a solution of triethylamine (21 mL; 150 mmole) and 2′,3′,5′-tri-O-acetyl-6-azauridine (19,4.14 g; 11 mmole (commercially available from Aldrich Chemical Company)) in 10 mL ACN was added. After stirring for an additional hour at room temperature, the resulting solution of activated nucleoside was transferred dropwise to a stirring solution of 1,4-diaminobutane (46 g; 524 mmole) in 20 mL MeOH. The solvents were removed in vacuo, and the residue was taken up in water, neutralized with acetic acid, and evaporated again to dryness. The crude residue was purified by chromatography on silica gel (95:5 MeOH—NH₄OH), followed by preparative reverse-phase HPLC to yield 150 mg (0.45 mmole; 3%) of the aminobutyl nucleoside (21). This was converted directly to the TFA-protected nucleoside (22) by reaction with 1-trifluoroacetylimidazole (3GOuL; 1.8 mmole) in 3 ml ACN at 0° C. for 2 hours, evaporating the solvent and purifying by flash chromatography (1:9 MeOH-DCM). Yield 175 mg (0.42 mmole; 93%). Identity of the product was confirmed by ¹H-nmr and mass spectrometry.

The nucleoside was converted to a 5′-triphosphate, deprotected, reacted with oxysuccinimidyl-(N-biotinoyl-6-amino)hexanoate, or oxysuccinimidyl-(N-(fluorescein-5-carboxyl)-6-amino)hexanoate, and purified according to procedures reported elsewhere (Prober, J. M., et al., 1988, PCT 0 252 683 A2.) to give the biotin- and fluorescein-labeled nucleotides (23a, 23b) in >95% purity.

Example 5 Synthesis of Biotin and Fluorescein C5-Labeled 1,2,4-Triazine-3,5-dione Riboside Triphosphates

5-Formyl-6-azauracil (24) is prepared according to literature procedures. See, Scopes, D.I.C. 1986, J. Chem. Med., 29, 809-816, and references cited therein. Compound 24 is reacted with the phosphonium ylide of 25, which is formed by treating 25 with catalytic t-butoxide, to provide the phthalimidoyl-protected allylamine 26. Protected allylamine 26 is ribosylated to provide β-anomer 28 upon reaction of 26 with β-D-pentofuranoside 27 (commercially available from Aldrich) according to the procedure of Scopes et al. 1986, J. Chem. Med., 29, 809-816. β-ribonucleoside 28 is deprotected with anhydrous hydrazine in THF to provide allylamine 29. Reaction of primary amine 29 with trifluoroacetylimidazole in THF affords the protected amine 30.

Nucleoside 30 is converted to a 5′-triphosphate, deprotected, reacted with oxysuccinimidyl-N-biotinoyl-6-amino)hexanoate or oxysuccinimidyl-(N-(fluorescein-5-carboxy)-6-amino)hexanoate and purified according to procedures reported elsewhere (Prober, J. M., et al. 1988, PCT 0 252 683 A2), giving, respectively, the biotin- and fluorescein-labeled nucleotides 31a and 31b.

Example 6 Synthesis of Biotin and Fluorescein C5-Labeled 5-Amino-1,2,4-triazine-3-one Riboside Triphosphates

β-ribonucleoside 28, described above, is treated with SOCl₂ or POCl₃ and subsequently reacted with ammonia to provide the 4-amino-1,3,6-triazine nucleoside 32. The phthalimide group of 32 is removed upon reaction with hydrazine, and the resulting primary amine is protected to afford nucleoside 33. Nucleoside 33 is converted to a 5′-triphosphate, deprotected, reacted with oxysuccinimidyl-(N-biotinoyl-6-amino)hexanoate or oxysuccinimidyl-(N-(fluorescein-5-carboxy)-6-amino)hexanoate and purified according to procedures reported elsewhere (Prober, J. M., et al. 1988, PCT 0 252 683 A2), giving, respectively, the biotin- and fluorescein-labeled nucleotides 34a and 34b.

Example 7 Procedure for HPLC Analysis of Enzymatic Incorporation of Modified Nucleotides

Reaction Conditions

-   TdT -   3 uM dT₁₆ template -   15(30) uM NTP -   40 U TdT (Promega) -   1× buffer, pH 7.5 (Promega)

Procedure: incubate 1 hr. at 37° C., then for 10 min. at 70° C., followed by the addition of EDTA (2 mM final concentration) in a volume of 50 uL

HPLC Analysis

Materials and Reagents

4.6 mm×250 mm Nucleopac PA-100 ion-exchange column (Dionex) buffer A: 20 mM NaOH (or 20 mM Tris pH 8, in the case of TdT incorporation of nucleotide triphoshates that are not dye-labeled) buffer B: 2GmM NaOH, 1M NaCl (or 20 mM Tris pH 8, 1M NaCl, in the case of TdT incorporation of nucleotide triphoshates that are not dye-labeled)

General Procedure

Dilute the reaction with 50 uL of buffer A. Inject 50 uL of this sample onto the HPLC column and fractionate using a gradient of 5 to 100% buffer B over 30 minutes at a flow rate of 1 mL/min. Detect the peaks simultaneously at 260 nm absorbance and the absorbance maximum of the dye (or the fluorescence emission maximum of the dye).

The incorporation efficiency is expressed as the fraction of oligonucleotide that is labeled. This number is determined by dividing the peak area measured at 260 nm absorbance of the labeled oligonucleotide by the sum of the peak areas of the unlabeled and labeled oligonucleotide. (The retention time of fluorescein-labeled dT₁₆ is on the order of 2 to 3 min. longer than the unlabeled dT₁₆.) The error in this type of assay is about 10%. The percentage labeling efficiency for 4 types of nucleic acid labeling compounds is shown below in Tables 1, 2 and 3.

TABLE 1 Labeling Efficiency

% Labeling Efficiency [TdT] = R B X 40 U 160 U H

—C(O)(CH₂)₅NH—Biotin 100 — H

5-carboxy-fluorescein 94 97 H

5-carboxy-fluorescein 58 98 H

trifluoroacetyl 55 — H

—C(O)(CH₂)₅NH—trifluoroacetyl 49 —

TABLE 2 Summary of TdT labeling efficiency data

% Labeling Efficiency[TdT] = X = B = R = linker and label 40 U 160 U H

R = —CO(CH₂)₅NH-biotin5-carboxyfluorescein6-carboxyfluorescein 100 9473 —9799 H

R = -biotin—CO(CH₂)₅NH-biotin—CO(CH₂)₅NHCO(CH₂)₅NH-biotin5-carboxyfluorescein 48415758 100 969498 OH

R = -biotin5-carboxyfluorescein6-carboxyfluorescein 476750 859893 OH

R = —CO(CH₂)₅NH-biotin—CO(CH₂)₅NH-fluorescein 9861 9688 H

R = 5-carboxyfluorescein 50 —

TABLE 3 Summary of TdT labeling efficiency data

% Labeling Efficiency[TdT] = X = B = R = linker and label 40 U 160 U control OH

R = 5-carboxyfluorescein 100  100  control H

R = -biotin5-carboxyfluorescein 9897 90100  analogs: H

R = -biotin—CO(CH₂)₅NH-biotin—CO(CH₂)₅NHCO(CH₂)₅NH-biotin5-carboxyfluorescein 48415758 100 969498 OH

R = -biotin5-carboxyfluorescein6-carboxyfluorescein 255337 849786 OH

R = -biotin 54 94

Example 8 Hybridization Studies of Labeled Imidazole Carboxamide (“ITP”) and 4-Aminopyrazolo[3,4-d]pyrimidine (“APPTP) Nucleotides

The performance of the labeled imidazolecarboxamide and 4-aminopyrazolo[3,4-d]pyrimidine nucleotides was evaluated in a p53 assay using standard GeneChip® product protocols (Affymetrix, Inc., Santa Clara, Calif.), which are described, for example, in detail in the GeneChip® p53 assay package insert. The sample DNA used in these experiments was the plasmid “p53mut248.” The labeled nucleotide analog was substituted for the usual labeling reagent (Fluorescein-N6-ddATP or Biotin-M-N6-ddATP (wherein M=aminocaproyl), from NEN, part #'s NEL-503 and NEL-508, respectively). Labeling reactions were carried out using both the standard amount of TdT enzyme specified in the assay protocol (25 U) and with 100 U of enzyme. After labeling, Fluorescein-labeled targets were hybridized to the arrays and scanned directly. In experiments using the biotin-labeled targets, the GeneChip® chips were stained in a post-hybridization step with a phycoerythrin-streptavidin conjugate (PE-SA), prior to scanning, according to described procedures (Science 280:1077-1082 (1998)).

FIG. 9 shows comparisons of the observed hybridization fluorescence intensities for the 1300 bases called in the “Unit-2” part of the chip. In the lower plot, intensities for the Fluorescein-ddITP (8b) labeled targets are plotted against those for the standard Fluorescein-N-6-ddATP labeled targets (control), both at 25 U of TdT. The observed slope of ˜0.75 indicates that the labeling efficiency of 8b was about 75% of that of Fluorescein-N6-ddATP under these conditions. In the upper plot, the same comparison is made, except that 100 U of TdT was used in the 8b labeling reaction. The slope of ˜1.1 indicates equivalent or slightly better labeling than the standard Fluorescein-N6-ddATP/25 U control reaction.

FIG. 10 shows comparisons of the observed hybridization fluorescence intensities for the 1300 bases called in the “Unit-2” part of the chip. Intensities for the Biotin-(M)₂-ddAPPTP (18c, M=aminocaproyl linker, referred to as Biotin-N-4-dPPTP in FIG. 10) labeled targets (after PE-SA staining) are plotted against those for the standard Biotin-M-N6-ddATP labeled targets (control), both at 25 U of TdT. The observed slope of ˜0.3 indicates that the labeling efficiency withBiotin-(M)₂-ddPTP (18c) was about 30% of that of Biotin-M-N6-ddATP under these conditions.

FIG. 11 shows comparisons of the observed hybridization fluorescence intensities for the 1300 bases called in the “Unit-2” part of the chip. In the lower plot, intensities for the Biotin-M-ddITP (8a, M=aminocaproyl; referred to as Bio-ddITP in FIG. 11) labeled targets are plotted against those for the standard Biotin-M-N6-ddATP labeled control targets, both at 25 U of TdT. The observed slope of ˜0.4 indicates that the labeling efficiency with 8a was about 40% of that of Biotin-M-N6-ddATP under these conditions. In the upper plot, the same comparison is made, except that 100 U of TdT was used in the 8a labeling reaction. The slope of ˜1.1 indicates equivalent or slightly better labeling than the standard Biotin-M-N6-ddATP/25 U control reaction.

FIG. 12 shows a comparison of the overall re-sequencing (base-calling) accuracy, for both strands, obtained using Fluorescein-ddITP labeled targets at both 25 U and 100 U of TdT, as well as the standard Fluorescein-N6-ddATP/25 U TdT labeled “control” targets. FIG. 13 shows a similar comparison for the targets labeled with biotin-M-ddITP (8a; referred to as Biotin-ddITP in FIG. 13) and biotin-M-N6-ddATP “control,” followed by PE-SA staining. FIG. 14 shows a comparison of re-sequencing accuracy using Biotin-(M)₂-ddAPPTP/100 U TdT and Biotin-M-N6ddATP/25 U TdT. These data indicate that labeled imidazolecarboxamide and 4-aminopyrazolo[3,4-d]pyrimidine dideoxynucleotide analogs can be used for DNA target labeling in hybridization-based assays and give equivalent performance to the standard labeled-N6-ddATP reagent.

Example 9

The performance of the biotin-labeled imidazolecarboxamide and 4-aminopyrazolo[3,4-d]pyrimidine nucleotides (“biotin-M-ITP” (8a) and “biotin-(M)_((M)) ₂-APPTP” (18c)) was evaluated using a single-nucleotide polymorphism genotyping GeneChip® chip array. Published protocols (D. G. Wang, et al., 1998, Science 280: 1077-82.) were used in these experiments, except for the following variations: 1) labeling reactions were carried out using both the standard amount of TdT enzyme specified in the published protocol (15U), or three-fold (45 U) enzyme; 2) substitution of the labeled nucleotide analog for the standard labeling reagent (Biotin-N6-ddATP, from NEN: P/N NEL-508); 3) the labeled nucleotide analog was used at either twice the standard concentration specified in the published protocol (25 uM), or at six-fold (75 uM). After labeling, biotin-labeled targets were hybridized to the arrays, stained with a phycoerythrin-streptavidin conjugate (PE-SA), and the array was scanned and analyzed according to the published procedure.

The data is shown in the Table 4 below. As indicated by the mean intensities of the observed hybridization signal (averaged over the entire array), labeling efficiency with biotin-M-ITP (8a) at 25 uM was as good as Biotin-N6-ddATPat 12.5 uM, and even higher intensity was gained by using 8a at 75 uM (entries 1-3; 7,8). Compared with the control, this analog provided equivalent or better assay performance, expressed as the ratio of correct base calls. Somewhat lower mean signal intensities are observed with biotin-(M)₂-APPTP (18c), reflecting the lower incorporation efficiency of this analog, but equivalent assay performance could still be achieved with this analog, using somewhat higher enzyme and nucleotide concentrations (entries 3-6).

TABLE 4 Comparison of Polymorphism Chip Data Correct Units Mean Base Call Entry Sample Nucleotide [Nucleotide] TdT Intensity Ratio 1 A Biotin-M- 75 15 164 0.98 ddIcTP (8a) 2 A Biotin-M- 75 45 235 0.98 ddIcTP (8a) 3 B Biotin-N6-M- 12.5 15 138 0.95 control ddATP (NEL 508) 4 B Biotin-N4-(M)₂- 25 15 37 0.88 ddAppTP (18c) 5 B Biotin-N4-(M)₂- 75 15 35 0.92 ddAppTP (18c) 6 B Biotin-N4-(M)₂- 75 45 87 0.95 ddAppTP (18c) 7 B Biotin-M- 25 15 116 0.95 ddIcTP (8a) 8 B Biotin-M- 75 15 149 0.95 ddIcTP (8a)

Example 10

High-density DNA probe arrays are proving to be a valuable tool for hybridization-based genetic analysis. These assays require covalent labeling of nucleic acid molecules with fluorescent or otherwise detectable molecules in order to detect hybridization to the arrays. We have pursued a program to develop a set of novel nucleotide analogs for enzymatic labeling of nucleic acid targets for a variety of array-based assays. Our primary goal was to provide new reagents for two particular labeling procedures: (i.), 3′ end labeling of fragmented, PCR-generated DNA targets with terminal deoxynucleotidyl transferase (TdT); and (ii.), template-directed internal labeling of in vitro transcription-generated RNA targets with T7 RNA polymerase (T7).

The general approach taken was to screen various base-substituted nucleotide analogs, using a rapid and quantitative HPLC-based assay, to empirically determine which analogs were efficient substrates for the polymerase of interest. The analogs selected for this study were nucleotides in which the native heterocyclic base was substituted with the following: 1-(imidazole-4-carboxamide), 1-(1,3,6-trazine-2,4-dione), 5-(1,3-pyrimidine-2,4-dione), 3-(pyrazalo-[4,3-d]pyrimidine), 1-(pyrazalo-[3,4-d]pyrimidine) and a simple carboxamide moiety. Labeled versions of promising candidate molecules were then designed and synthesized for further testing of relative incoproation efficiency and functional performance in array-based assays.

It was determined that TdT was generally tolerant of base substitutions, and that ribonucleotides were about as efficiently incorporated as 2′-deoxy, and 2′,3′-dideoxynucleotides. In contrast, T7 was relatively intolerant of heterocyclic base substitutions with the exception of the 5-(1,3-pyrimidine-2,4-dione), i.e. the pseudo-uridine analog. Two new reagents, a C4-labeled 1-(2′,3′-didexoy-β-D-ribofuranosyl) imidazole-4 carboxamide 5′-triphophate and an N1-labeled pseudo-uridine 5′-triphophate, were found to be excellent substrates for TdT and T7, respectively. These new analogs prove array assay performance equivalent to that obtained using conventional labeling reagents.

Example 11 Synthesis of Fluorescent Triphosphate Labels

To 0.5 pmoles (50,&L of a 10 mM solution) of the amino-derivatized nucleotide triphosphate, 3′amino-3′deoxythymidinetriphosphate (1) or 2′-amino-2′-deoxyuridine triphosphate (2), in a 0.5 ml ependorf tube was added 25 μL of 11 M aqueous solution of sodium borate, pH 7, 87 μL of methanol, and 88 μL (10 pmol, 20 wquiv) of a 100 mM solution of 5-carboxyfluorescein-X—NHS ester in methanol. The mixture was vortexed briefly and allowed to stand at room temperature in the dark for 15 hours. The sample was then purified by ion-exchange HPLC to afford the fluoresceinated derivatives Formula 3 or Formula 4, below, in about 78-84% yield.

Experiments suggest that these molecules are not substrates for terminal transferase (TdT). It is believed, however, that these molecules would be sutstrates for a polymerase, such as klenow fragment.

Example 12 Synthesis of as-Triazine-3,5[2H,4H]-diones

The analogs as-triazine-3,5[2H,4H]-dione (“6-aza-pyrimidine”) nucleotides (see, FIG. 23 a) are synthesized by methods similar to those used by Petrie, et al., Bioconj. Chem. 2: 441 (1991).

Other useful labeling reagents are sythesized including 5-bromo-U/dUTO or ddUTP. See for example Lopez-Canovas, L. Et al., Arch. Med. Res 25: 189-192 (1994); Li, X., et al., Cytometry 20: 172-180 (1995); Boultwood, J. Et al., J Pathol. 148: 61 ff. (1986); Traincard, et al., Ann. Immunol. 1340: 399-405 (1983); and FIGS. 23 a, and 23 b set forth herein.

Details of the synthesis of nucleoside analogs corresponding to all of the above structures (in particular those of FIG. 23 b) have been described in the literature Known procedcures can be applied in order to attach a linker to the base. The linker modified nucleosides can then be converted to a triphosphate amine for final attachment of the dye or hapten which can be carried out using commercially available activated derivatives.

Other suitable labels include non-ribose or non-2′-deoxyribose-containing structures some of which are illustrated in FIG. 23 c and sugar-modified nucleotide analogues such as are illustrated in FIG. 23 d.

Using the guidance provided herein, the methods for the synthesis of reagents and methods (enzymatic or otherwise) of label incorporation useful in practicing the invention will be apparent to those skilled in the art. See, for example, Chemistry of Nucleosides and Nucleotides 3, Townsend, L. B. ed., Plenum Press, New York, at chpt. 4, Gordon, S. The Synthesis and Chemistry of Imidazole and Benzamidizole Nucleosides and Nucleotides (1994); Gen Chem. Chemistry of Nucleosides and Nucleotides 3, Townsend, L. B. ed., Plenum Press, New York (1994); can be made by methods simliar to those set forth in Chemistry of Nucleosides and Nucleotides 3, Townsend, L. B. ed., Plenum Press, New York, at chpt. 4, Gordon, S. “The Synthesis and Chemistry of Imidazole and Benzamidizole Nucleosides and Nucleotides (1994); Lopez-Canovas, L. Et al., Arch. Med. Res 25: 189-192 (1994); Li, X., et al., Cytometry 20: 172-180 (1995); Boultwood, J. Et al., J Pathol. 148: 61 ff. (1986); Traincard, et al., Ann. Immunol. 1340: 399-405 (1983).

Example 13 Synthesis of N1-labeled 5-(β-D-ribofuranosyl)-2,4(1H,3H)-pyrimidinedione 5′-triphosphate 42a and 42b (FIG. 16)

To 5-(β-D-ribofuranosyl)-2,4(1H,3H)-pyrimidinedione 39 (100 mg, 0.41 mmol, 1 eq.) in acetonitrile (5 ml) was added 1 M TEAB, pH 9 (5 ml) followed by methyl acrylate (5.5 ml, 61 mmol, 150 eq). The reaction was stirred at room temperature overnight. The solvents were evaporated, and the residue was coevaporated with water (3×, 5 ml) yielding 135 mg of acrylate 40. The acrylate 40 was then treated with neat ethylenediamine (2 ml, excess) and two drops of TEA and heated to 100° C. After 1 hour the excess EDA was evaporated, yielding 146 mg of the free amine (quantitative). The crude residue was then co-evaporated with pyridine (3×, 5 ml, insoluble), resuspended in a mixture of pyridine and DMF and was cooled to 0° C. To this mixture was added TFA-imidazole (73.8 mg, 1.1 eq.). The reaction was then allowed to warm to room temperature and stirred overnight. An additional 1 eq. of TFA-imidazole was added at this time and the reaction was stirred an additional 15 minutes. The solvent was then evaporated, and the residue was co-evaporated with water(2×, 5 ml) and dissolved in 5 ml of water. The white precipitate that formed was removed by filtration. The mother liquor, which contained the TFA-protected nucleoside 3, was separated into two aliquots and purified by reverse phase HPLC. The fractions were then pooled and evaporated to yield 20% (35 mg) of pure 41, which was verified by ¹H NMR. Using standard procedures (eg., Prober, et al., EP 0252683), compound 41 was converted to the triphosphate, which was then conjugated to biotin and fluorescein to afford 42a and 42b.

Synthesis of the N1-labeled 2-amino-5-(β-D-ribofuranosyl)-4 (1H)-pyrimidinone, 55, involved alkylation at N1 using conditions similar to those described by Muehlegger, et al. (WO 96/28640) for the N1-alkylation of pyrazalo-[4,3-d]pyrimidines (Scheme 2).

The IVT incorporation efficiency (the number of labeled analogs incorporated per transcript) of the N1-fluorescein-X-5-(β-D-ribofuranosyl)-2,4 (1H,3H)-pyrimidinedione 5′-triphosphate 42a was measured by HPLC (diode array UV detection at 260 nm and 495 nm) in an IVT amplification of a 1.24 kb transcript. See U.S. patent application Ser. No. 09/126,645 for additional details on test methods used. Table 1 summarizes the data obtained using different ratios of UTP/5 At a ratio of 1:5, the incorporation and relative yield (measured relative to the yield obtained with UTP only) of transcript are optimal. This transcript was compared in a hybridization assay to transcript labeled using fluorescein. The preliminary results indicated that the N1-fluorescein-X-5-(β-D-ribofuranosyl)-2,4 (1H,3H)-pyrimidinedione 5′-triphosphate (42a) performed equivalently in a hybridization assay in terms of number of correct calls and in hybridization intensity (Charts 2 and 3). The hybridization assay used for this purpose was the Affymetrix HIV-PRT GeneChip assay (see Kozal, et al. Nature Medicine 1996, 2: 753-9.).

Similarly, the efficiency of DNA 3′-end labeling of a polythymidylate oligonucleotide (T₁₆) using terminal deoxynucleotidyl transferase and N1-fluorescein and biotin-labeled 5-(β-D-ribofiranosyl)-2,4 (1H,3H)-pyrimidinedione 5′-triphosphate, was determined by HPLC. In this analysis, the percent conversion of oligo-T₁₆ to the 3′-end labeled T₁₆-F1, is determined by AX-HPLC (see U.S. patent application Ser. No. 09/126,645 for detailed procedures). The data is summarized in Chart 4. The incorporation of the biotin and fluorescein triphosphates was very efficient as determined by HPLC.

CHART 1 Incorporation efficiency of N1-fluorescein-labeled 5-(β-D- ribofuranosyl)-2,4(1H,3H)-pyrimidinedione 5′-triphosphate 42a, determined by HPLC

The labeling reaction conditions are the standard conditions used in the Affymetrix HIV-PRT GeneChip product assay (see Kozal, et al. Nature Medicine 1996, 2: 753-9.).

CHART 2 Call accuracy of N1-fluorescein-labeled 5-(β-D-ribofuranosyl)- 2,4(1H,3H)-pyrimidinedione 5′-triphosphate 42a.

Data was obtained from Affymetrix HIV-PRT GeneChip hybridization assay (see Kozal, et al. Nature Medicine 1996, 2: 753-9.).

CHART 3 Hybridization signal of fluorescein labeled triphosphate 42a

Data obtained from hybridization of labeled transcript to the Affymetrix HIV-PRT GeneChip array (see Kozal, et al. Nature Medicine 1996, 2: 753-9.).

CHART 4 TdT labeling efficiency of Fluorescein and Biotin labeled 5-(β-D- ribofuranosyl)-2,4(1H,3H)-pyrimidinedione 5′-triphosphate 42a and 42b, determined by HPLC.

Reaction conditions: TdT (40 units), 20 uM U*TP and 3.2 uM T₁₆ oligo in 50 ul of water. Heated at 37° C. for 1 hour and 70° C. for 10 min., followed by 1 ul of 100 mM EDTA. HPLC analysis was performed on a Dionex DNAPac ™ PA-100 column.

Example 14 Synthesis of fluorescein derivatives of 2′-amino-2′-deoxyuridine triphosphate and 3′-amino-3′-deoxythymidinetriphosphate (Scheme 3)

To 0.5 mmoles (50 uL of a 10 mM solution) of the amino nucleotide triphosphate (1 or 2) in a 0.5 ml ependorf tube was added 25 ul of a 1 M aqueous solution of sodium borate, pH 8, 87 uL of methanol, and 88 uL (10 mmol, 20 equiv) of a 100 mM solution of 5-carboxyfluorescein —X—NHS ester in methanol. The mixture was vortexed briefly and allowed to stand at room temperature in the dark for 15 hours. The sample was then purified by ion-exchange HPLC to afford the fluoresceinated derivatives 3 or 4 in about 78-84% yield. Relative efficiencies of incorporation of these compounds by TdT are shown in Table 5.

TABLE 5 Incorporation of triphosphate compounds by TdT.

TdT Labeling Efficiencies % Labeled X (3′) Y (2′) B (1′b) 40 U 160 U OH H uracil 100.0 100.0 NH2 H thymine 100 100 NHCO(CH2)5NH—(CO—FL) H thymine 1.3 2.2 OH NH2 uracil 65 95 OH NHCO(CH2)5NH—(CO—FL) uracil 3.0 6.6 OH O(CH2)6NH—(CO—FL) uracil 2.5 7.0 OH O(CH2)6NHCO—(CH2)5—NHCO— uracil 15.0 17.0 Biotin OH NH(CH2)5CH3 uracil 4.5 5.0 OH H NHCO(CH2)5NH—(CO—FL) 45.0 55.0

Example 15 Synthesis of N-fluorescein-5-carboxamido)ethyl-3-deoxy allonamide-6-O-triphosphate (FIG. 17)

N1-(di-O-acetylfluorescein-5-carboxamido)ethyl-3-deoxy-4-O-acetyl-6-O-dimethoxytrityl allonamide 43 (U.S. patent application Ser. No. 08/574,461) was detritylated with 80% acetic acid, and the crude product was purified on a small silica gel column to obtain N1-(di-O-acetylfluorescein-5-carboxamido)ethyl-3-deoxy-4-O-acetyl allonamide 44. The allonamide was phosphorylated using POCl₃ followed by reaction with pyrophosphate (Bogachev, Russ. J. Bioorg. Chem. 1996, 22: 559-604). The crude product was treated with NH₄OH to remove the acetyl protecting groups, then purified using a preparative Source QTM AX-HPLC column. Pure fractions (analysed by analytical ion-exchange HPLC) were pooled and evaporated to near-dryness. The triphosphate salt 45a was precipitated with MeOH-acetone and dried under high vacuum to obtain a product which was 98% pure by ion-exchange HPLC and 31 p NMR.

Example 16 Synthesis of N-(6-(fluorescein-5-carboxamido)hexanoyl)-morpholino uridine triphosphate (Scheme 5)

Morpholino-uracil tosylate salt 1 (30 mg) was co-evaporated with pyridine (3×3 ml) and dissolved in 2 ml of pyridine and cooled to 0° C. Trifluoroacetic anhydride (30 uL) was added and stirred for 1 hour. The reaction was followed by HPLC until complete. The pyridine was removed and the residue was dissolved in 1 ml of water and filtered. The product was purified by HPLC on a Waters C-18 bondapak cartridge (Buffer: A=50 mM TEM pH 7.0; B=acetonitrile) using a gradient of 0-25% B in 30 minutes (retention time=22 min.). The product was desalted on a Sep-Pak cartridge and freezedried to give 151 mg of 2. Phosphorylation of 2 using the POCl₃ method gave 3. The removal of the trifluoroacetyl group with conc. NH4OH at 50° C. for 30 min to 4 followed by conjugation to 5-carboxyfluoroscein-aminocaproic acid N-hydroxysucciimide (F1-X—NHS) under standard conditions gave thE amide 5. The mass spectral and NMR data for compounds 1-5 were consistent with the proposed structures.

Example 17 Labeled N-(2-hydroxyethoxy)ethyl 2-O-triphosphates (Scheme 6)

Example 18 Labeled 2-(2-hydroxyethyl)acetamide 2-O-triphosphates (Scheme 7)

Example 19 Synthesis of N-alkyl 2′-amino-2′-deoxyuridine triphosphate (Scheme 8)

Example 20 Synthesis of 2′-O-(6-(Fluorescein-5-carboxamido)hexyl)uridine 5′-O-triphosphate (Scheme 9)

Example 21 Synthesis of 2′-S(N-(6-(Fluorescein-5-carboxamido)hexyl)-aminoethyldithiouridine 5′-O triphosphate (Scheme 8)

All patents, patent applications, and literature cited in the specification are hereby incorporated by reference in their entirety. In the case of any inconsistencies, the present disclosure, including any definitions therein will prevail.

The invention has been described with reference to various specific and preferred embodiments and techniques. However, it should be understood that many variations and modifications may be made while remaining within the spirit and scope of the invention. 

1. A nucleic acid labeling compound of the following structure:

wherein A is H, monophosphate, diphosphate, triphosphate, phosphoramidite ((R₂N)(R′O)P) wherein R is linear, branched or cyclic alkyl and R′ is a protecting group, or H-phosphonate (HP(O)O—HNR₃), wherein R is linear, branched or cyclic alkyl; X is O, S, NR₁ or CHR₂, wherein R₁ and R₂ are, independently, H, alkyl or aryl; Y is H, N₃, F, OR₉, SR₉ or NHR₉, wherein R₉ is H, alkyl or aryl; Z is H, N₃, F or OR₁₀, wherein R₁₀ is H, alkyl or aryl; L is an amido alkyl group; Q is a detectable moiety; and, M is a connecting group, and wherein n is an integer ranging from 0 to about
 3. 2. The nucleic acid labeling compound of claim 1, wherein A is H or H₄O₉P₁ ⁻ with an appropriate counterion.
 3. The nucleic acid labeling compound of claim 1, wherein X is O.
 4. The nucleic acid labeling compound of claim 1, wherein Y is OR₉, wherein R₉ is H, alkyl or aryl.
 5. The nucleic acid labeling compound of claim 1, wherein Z is OR₁₀, wherein R₁₀ is H, alkyl or aryl.
 6. The nucleic acid labeling compound of claim 1, wherein L is —C(O)NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about
 10. 7. The nucleic acid labeling compound of claim 1, wherein M is —NH(CH₂)_(n)NH— or —C(O)(CH₂)_(n)NH— wherein n is and integer from about 0 to about
 5. 8. The nucleic acid labeling compound of claim 1, wherein Q is biotin.
 9. The nucleic acid labeling compound of claim 1, wherein A is H₄O₉P₃ ⁻ with an appropriate counterion; X is O; Y is H or OH; Z is H or OH; L is —C(O)NH(CH₂)₂NH—, M is —C(O)(CH₂)₅NH—; and Q is biotin.
 10. The nucleic acid labeling compound of claim 1, wherein the compound is a stereoisomer.
 11. The nucleic acid labeling compound of claim 10, wherein the compound has the following structure:


12. The nucleic acid labeling compound of claim 9, wherein the compound has the following structure:


13. The nucleic acid labeling compound of claim 11, wherein A is H or H₄O₉P₃ ⁻ with an appropriate counterion.
 14. The nucleic acid labeling compound of claim 11, wherein X is O.
 15. The nucleic acid labeling compound of claim 11, wherein Y is OR₉, wherein R₉ is H, alkyl or aryl.
 16. The nucleic acid labeling compound of claim 11, wherein Z is OR₁₀, wherein R₁₀ is H, alkyl or aryl.
 17. The nucleic acid labeling compound of claim 11, wherein L is —C(O)_(n)H(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about
 10. 18. The nucleic acid labeling compound of claim 11, wherein M is —NH(CH₂)_(n)NH— or —C(O)(CH2)_(n)NH— wherein n is and integer from about 0 to about
 5. 19. The nucleic acid labeling compound of claim 11, wherein Q is biotin.
 20. The nucleic acid labeling compound of claim 12, wherein A is H or H₄O₉P₃ ⁻ with an appropriate counterion.
 21. The nucleic acid labeling compound of claim 12, wherein X is O.
 22. The nucleic acid labeling compound of claim 12, wherein Y is OR₉, wherein R₉ is H, alkyl or aryl.
 23. The nucleic acid labeling compound of claim 12, wherein Z is OR₁₀, wherein R₁₀ is H, alkyl or aryl.
 24. The nucleic acid labeling compound of claim 12, wherein L is —C(O)NH(CH₂)_(n)NH—, wherein n is an integer ranging from about 2 to about
 10. 25. The nucleic acid labeling compound of claim 12, wherein M is —NH(CH₂)_(n)NH— or —C(O)(CH₂)_(n)NH— wherein n is and integer from about 0 to about
 5. 26. The nucleic acid labeling compound of claim 12, wherein Q is biotin.
 27. A nucleic acid derivative produced by coupling a nucleic acid labeling compound according to claim 11 with a nucleic acid.
 28. The nucleic acid derivative of claim 27, wherein the coupling comprises incubating with a terminal transferase.
 29. A hybridization product comprising the nucleic acid derivative according to claim 27 bound to a complementary probe.
 30. The hybridization product of claim 29, wherein the probe is attached to a solid support.
 31. The hybridization product of claim 30, wherein the solid support is glass.
 32. A nucleic acid derivative produced by coupling a nucleic acid labeling compound according to claim 12 with a nucleic acid.
 33. The nucleic acid derivative of claim 32, wherein the coupling comprises incubating with a terminal transferase.
 34. A hybridization product comprising the nucleic acid derivative according to claim 32 bound to a complementary probe.
 35. The hybridization product of claim 34, wherein the probe is attached to a solid support.
 36. The hybridization product of claim 35, wherein the solid support is glass.
 37. A method of synthesizing a nucleic acid derivative comprising coupling a nucleic acid labeling compound according to claim 11 to a nucleic acid.
 38. The method of claim 37, wherein the coupling is achieved using a terminal transferase.
 39. A method of detecting a nucleic acid comprising hybridizing a nucleic acid derivative according to claim 37 with a complementary probe.
 40. The method according to claim 39, wherein the probe is attached to a solid support.
 41. The method according to claim 40, wherein said solid support is glass.
 42. A method of synthesizing a nucleic acid derivative comprising coupling a nucleic acid labeling compound according to claim 12 to a nucleic acid.
 43. The method of claim 42, wherein the coupling is achieved using a terminal transferase.
 44. A method of detecting a nucleic acid comprising hybridizing a nucleic acid derivative according to claim 42 with a complementary probe.
 45. The method according to claim 44, wherein the probe is attached to a solid support.
 46. The method according to claim 45, wherein said solid support is glass. 